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<210> 1 

<211> 88421 

<212> DNA 

<213> Actinoplanes sp. 
<220> 

<221> mi sc_f eature 

<222> (2077) . . (3078) 

<223> ORF 1; positive strandedness 



<220> 

<221> mi sc_f eature 

<222> (3118) . . (4032) 

<223> ORF 2; positive strandedness 



<220> 

<221> misc_f eature 

<222> (4038) . . (5048) 

<223> ORF 3; positive strandedness 



<220> 

<221> misc_f eature 

<222> (6665) . . (5814) 

<223> ORF 4; negative strandedness 



<220> 

<221> misc_f eature 

<222> (7703) . . (6693) 

<223> ORF 5; negative strandedness 



<220> 

<221> misc_feature 

<222> (9464) . . (8130) 

<223> ORF 6; negative strandedness 
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<220> 

<221> misc_f eature 

<222> (9691) . . (10761) 

<223> ORF 7; positive strandedness 



<220> 

<221> misc_f eature 

<222> (12751) . . (10829) 

<223> ORF 8; negative strandedness 



<220> 

<221> misc_f eature 

<222> (13617) . . (12802) 

<223> ORF 9; negative strandedness 



<220> 

<221> mi sc_f eature 

<222> (15203) . . (13614) 

<223> ORF 10; negative strandedness 



<220> 

<221> misc_f eature 

<222> (15591) . . (15863) 

<223> ORF 11; positive strandedness 



<220> 

<221> mi sc_f eature 

<222> (15880) . . (19035) 

<223> ORF 12; positive strandedness 



<220> 

<221> misc_f eature 

<222> (19032) . . (39713) 

<223> ORF 13; positive strandedness 



<220> 

<221> misc_f eature 

<222> (39713) . . (65800) 

<223> ORF 14; positive strandedness 



<220> 

<221> misc_f eature 

<222> (65826) . . (66530) 

<223> ORF 15; positive strandedness 



<220> 

<221> misc_f eature 

<222> (66546) . . (67370) 

<223> ORF 16; positive strandedness 
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<220> 

<221> iuisc_f eature 

<222> (67384) . . (70059) 

<223> ORF 17; positive strandedness 



<220> 

<221> misc_f eature 

<222> (70099) . . (70662) 

<223> ORF 18; positive , strandedness 



<220> 

<221> misc_f eature 

<222> (70659) . . (71906) 

<223> ORF 19; positive strandedness 



<220> 

<221> misc_f eature 

<222> (73439) . . (71964) 

<223> ORF 20; negative strandedness 



<220> 

<221> misc_f eature 

<222> (74216) . . (73563) 

<223> ORF 21; negative strandedness 



<220> 

<221> misc_feature 

<222> (75424) . . (74213) 

<223> ORF 22; negative strandedness 



<220> 

<221> misc_feature 

<222> (75535) . . (76464) 

<223> ORF 23; positive strandedness 



<220> 

<221> misc_f eature 

<222> (78110) . . (76449) 

<223> ORF 24; negative strandedness 



<220> 

<221> misc_f eature 

<222> (79864) . . (78107) 

<223> ORF 25; negative strandedness 



<220> 

<221> misc_feature 

<222> (81624) . . (79861) 

<223> ORF 26; negative strandedness 
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<220> 

<221> misc_f eature 

<222> (81909) . . (81682) 

<223> ORF 27; negative strandedness 



<220> 

<221> misc_f eature 

<222> (82346) ,\ (82062) 

<223> ORF 28; negative strandedness 



<220> 

< 2 2 1 > mi sc_f eature 

<222> (82587) . . (84446) 

<223> ORF 29; positive strandedness 



<220> 

<221> mi sc_f eature 

<222> (84481) . . (85548) 

<223> ORF 30; positive strandedness 



<220> 

<221> misc_feature 

<222> (85556) . . (86845) 

<223> ORF 31; positive strandedness 



<220> 

<221> misc_feature 

<222> (87372) . . (86803) 

<223> ORF 32; negative strandedness 



<220> 

<221> misc_f eature 
<222> (87494) . . (88420) 

<223> ORF 33; positive strandedness; N-terminus only 
<400> 1 

ggcgaactgc ttgtcctcgc tcggcggcag gctgttccac ctgtccttct cggccatcgg 60 
cacgatcacc tcgttgaaga gggggttgcc gagccgggag acctgcacct gcgggccgac 120 
ggtcacatcg ccggaggagg agccgtcgcg gacctgaacc tgacggcgac tggccgaggt 180 
ccacaccccg atgaccgcgc ccgcgtcacg tccgcgcacc cgcttcttgc cgtcacggcg 240 
caccatgtgt acggggatct gcagcgcgat gctgtgcacg ttggtcttgt cggtggcgtt 300 
gaccgccttg ccggcgtagt tgaacaggtt ctgcccgacc aggtgcttgt cctggaacgg 360 
gcgcagcgtg ccgaggtcga agatggcgcc cagatcgacg aagaaggcgt cggcgcgctg 420 
gccggcgaag accttctcgc cggtcttcag cttgtgcacg gcgtcggcgg cgaggccgtc 480 
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gtagtcggcg atcgacacct tgccgacgtt gcacggcggg cagggcagct tgctcgccag 540 

caccgtgctc ttgccgtgct tgtcgacctt cgtcaccgag tagaactggc ggcgattcca 600 

gttctcgctg tcgagcgact cgatcgggcc ggtgttgtag aggaacgtct tgttgttgcg 660 

cagctcggtg cggaaccgga actggtaggt gatctcggcg cgggcgtcac cgtccgagtc 720 

gatgtggatc tcgtagacga cgtcgtcgcc gaactcgaag aagttcgggc cgctcgccgg 780 

cagctgcaac ggcacgtagt tggcgatcag ggtgaccgtg tcgggctcgt cggggctgac 840 

gaaggcgtac aggtcggagc tgtcggccac cgggtccttg ctgatctcgg gtgcttcgcg 900 

gtgtgaagac atgtcatgcc ctcccggcgg ccttgatcag gctcgcgacg agcttcttgt 960 

cggtgatcgt cttgagctgc tcgcccgaga agacgtcgat cgtgcccttc ttggcgtccc 1020 

ggatggagac gacgatcggg gagccgtcac cggtgggctc gttctcgccg gcgaggctca 1080 

gcccggcgac cgaggcggcc gatacgccga cgccggtcgc ggcgagggcg aggacccggc 1140 

gacgggaaag ccacggacgt gctgccgggg cgcgatagcg agtgcgactc atgctcgacc 1200 

tttcgtcttt cttacggtcg gtggtgcgag gactggaacg ggcgttgcgt gtcgacgccc 1260 

gtcagtacgt gatccaccgg cctgcggttc aaagacgaat tggagcgctc ccaacgggac 1320 

gactcgagcc gcgccggggc gggcacggca ccggccgtgc ccgccccgcg tgcgcggtca 1380 

ccgccagttg gcgtgatccg cggcggcggc gcggtcgcgc tcgtcggcgg cccggcggcc 1440 

cttgtcgacc tcgcgcatcg accagccgat cgcgacggcg gccaggtagg cgagggtgcc 1500 

gagcaccgcc acggcccgga agccgccggt cacccagcag gccaggaaca cggccaggcc 1560 

accggcgccg atgcccgccg cccacgagga cggcagccgg cggatcgccg agccgaccag 1620 

cacggccagg cccagcgacg cgatcgggct cggctcctgc ggcaggtcgg cgaagtggga 1680 

gacgagcacg gcgatcaggc cgaggcccac gcccgccgcc gcggtcatcg ccgggttgat 1740 

ccggcgggcc agggccaggc cggtcatcag cacggcgacg atcgtgtcga agaccgcgta 1800 

accggcgccc cagctgtcgg agagcatgta gacggtgaac gcgatgccgg ccagcgcgat 1860 

cagcccgaac accacgtcga aggcgacacc ggcccgcggg gccggccttg ctgaagtagt 1920 

catgccgccc acgctaggaa atcggcccgt ccggcgaacc ggtcgaaagt acggtcggcg 1980 

ccgcggcgaa ccctgctttg gaccgatgct cgcgacgggg tgcttccccg agggtgatgg 2040 

catgcgtttc taccacgagg tgcgccgatg atctggatga gctggcgcca gttccgctgg 2100 

caggccctgg ccggtgccgt cgccctggtg ccgttggtgg cctacttgat cgtcacgagc 2160 

ctggacatcc ggcgcgccca cgaccgctat caggcgcagt gcgcgtccat cggcaactgc 2220 

gccgaggcga tgctccagtt ccagaacgac ttccgcaccc gcctgctgct gctcgccatc 2280 
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ctgctggccg cgatccccgg catcctcggg gtgttctggg gcgcgccgct ggtggcccgc 2340 

gagctcgaga ccggcacgca ccgcctggtc tggaaccaga gcgtcacccg gcgccggtgg 2400 

ctggcggtca aggtgctgtt cgtcggtgtc gccgcgatgg ccgtggccac gctcgtcagc 2460 

acgctgctga cctgggcgag cagcccggtc gacgcggtgt cgcaggaccg gttcggcgcg 2520 

ctggtgttcg acgcccgcaa catcgtgccg gtcgcgtacg ccgccttcgc cctcgtcctc 2580 

ggcacggtga tcggcctgct cgtgcgccgc accatcccgg ccatggcgct caccatgctc 2640 

gtcttcgccg tcgtgcagtt caccgtgccg gcgctggccc ggccgcacct gatggcgccg 2700 

gagacccaga cccggcagat gacgttgcag gagttcggcg aggtgcgcgg cttcggcgac 2760 

gagcccacgg tcaacgggct gagcatccgg ggcgcgtggg tgaccagcac cagcccgctg 2820 

ctcaccgccg acgggacccg gctcgacaag gccacgtacc gcaaatgcgt gaccgacccc 2880 

ccggccgtct cgggcggagc tcccggcgtc ggcggcaccg tcgcctgcct ggccgacctc 2940 

gatctgcacg tcgaggtggc ctaccagccc aacgaccggt actggacctt ccagtggatc 3000 

gagtcggccc tctacctggc gctcggtgga ctgctcctcg ccgtgggcct gtggcgcatc 3060 

cgccgccacg tcatctgatc gtccccccgt ccgcacggat tcgaaggata gaaagacatg 3120 

ccacacgagg attcctcgcc cgttctgcag gcggagggct tgaccaaacg ctacggtcgg 3180 

cgcaccgccc tgcaggactg caacctgacc attccgcgcg gccgggtgat cggcctggtc 3240 

ggcccgaacg gcgccggcaa gtcgacgctg ctccagctgg cctgcgggct gatcacgccg 3300 

tcggagggct cgctgcgcgt gctcggcgag acgccggccg cgaacgccgg ccacctcgcc 3360 

aaggtcggct tcgtcgcaca ggacaccccg gtctacagca acttcacggt cggcgaccac 3420 

ctgaagatgg gtgccaagct caacccgacg tgggaccagg cgctcgccga gcgccgcgtc 3480 

gcgcaggtcg ggctcaacca cggccagaag gcgggccggc tctccggcgg tcagcgcgcc 3540 

cagctcgccc tgacgcttgc cgccgccaag cgcccggaac tgctgatgtt cgacgagccg 3600 

gccgccgcgc tcgacccgct ggcccgcgac ggcttcctgc agaacctgct cgagttcgtc 3660 

accgagctcg acgccagcgc gatcctgtcg tcgcacctgc tcggcgacgt cgagcgcgtc 3720 

tgcaactacc tgatcgtgct ctgcgcctcc cgggtgcagg tcgccggcga cgttcccgac 3780 

ctgctcaaca cgcactaccg catcgtcgcg ccccgcggcg agctggacca tccgccggcc 3840 

ggcctcgagg tcatccgggc gcagcacgcc gaccggtaca ccaccgccgt cgtgcgcggc 3900 

gacggcagcc ggccgagcac ctggacgatc gagcccatcc agctcgagga gctcgtgctg 3960 

gcgtacatga cgcgggcgat gggcgtcacc ggcgagccgc tgatggccgc gtccggggag 4020 

gtcgtccgtt gatctggatg agctggcggc agttccgcgg tcaggccgtc gtcggggtcg 4080 
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tccrtactcrcrc 


cctgctcgcc 


gcatacctgg 


tctacctcgg 


cgtcgacatc 


cacaacacct 


4140 


acgacgacta 


tcaacrccrcaa 


taccccgcgg 


gcggcgactg 


cgccgggccc 


ctgggccagt 


4200 


tcagcctcga 


ctacgagaac 


aegttgetet 


atctggccQQ 


catgctggcg 


ctggtgcccg 


4260 


ace tactcaa 

3 ^ ^ 3 ^ ^ 3 3 


catgttctgg 


ggcgcgcccc 


tgatcacccg 


aaaactagaa 


aacggcaccc 


4320 


agcacctaat 


gtggaaccag 


agcgtgaccc 


gccgccgatq 


getgetgate 


aagctactcg 


4380 


tcgtgggctt 


ggcctgcatg 


atagtaaccQ 


agcrtcrccaaa 


cctgctgctg 


acctgggccg 


4440 


ccacaccaat 


cgacaatgtg 


gccgacaacc 


ggttcagcac 


aataatattc 


ggagcccggt 


4500 


tec tgccgcc 


gatcgee tac 


accaccttca 


cat tcatac t 


cggcacgctc 


atcggcctgc 


4560 


taatccacca 


aacaatacca 


acaataacac 

Svywuyyvyv 


tcacgctcgt 


aacattcata 

33 w 3 ^* ^ 3 ^3 


atcttccagt 


4620 


tcctggtgcc 


aaacctaata 


cgcccccacc 


tcatgccggc 


caagcacctg 


gtcaagcega 


4680 


tgacggtgag 


cgccatcaac 


gaggecaagt 


cactaaacaa 

53 X-f 53 53 53 ^-r V** 53 


catcaccggc 


acaccaatac 


4740 


tgaaeggect 


gtcgatctcg 


caaaac t aaa 


tcaccgacgt 


cagcgcgc tc 


aagaccgccg 


4800 


acggccggtc 


gctggacgcg 


aagacgttcg 


acaactgeta 


catgaacgeg 


cccaagaccg 


4860 


atacaaccaa 


craacccatac 


ggtgacgtcg 


cggtctgcct 


ggccaagctg 


gacctgcacg 


4920 


tcgacatcgc 


ctaccagccg 


taaaaccaat 

vyy««v^yy 


actgggcett 


ccagttcctc 


gaatcggggt 


4980 


tctatgtgct 


actcaacaac 


ctgetgateg 


acaccacaat 


gtggcgcgtc 


caacaacaac 

^^y-^J 3^ 33^ 


5040 


ccagctgaga 


taaacacaca 


cagcaccgtg 


accaaaccaa 


gcgcactcgt 


ccgtggcacc 


5100 


caaaaaacca 




caacrccrtcat 


accacaataa 

y^*^*3^*33 ^"33 


ccgccctggt 


catcaacgag 


5160 




ggtaccgcac 


aatac taata 
yy u y^ v -yy '-y 


acactaaaaa 


acccaccaac 


cgacagcgac 


5220 


gaagcagt tc 


tcaaac taat 

»- « 3 3 3 *~ 3 3 


c tccc tgcac 


atcacacaaa 


aacacoaccrc 

53 53 53 ^ 53 >Mr 


accggtccat 


5280 


ctgtacggcc 


aggactacat 


aaaca tac tc 

53 53 53 N *-" 53 


aaaacccacc 


tggccgcacc 


ac taatccaa 

53 53 53 ' ^ 53 ^3 


5340 


crcot tccfcrca 

y vy ^— ' "^yy **y 


ccqgqq tcrat 
uu yyyy y v - 


gccggtccgt 


gcgccgctgc 


teg tec tgt t 


cacac tat tc 


5400 


ctggtcgtca 


tgtaccagct 


cacccaaaaa 

Vn* %p** 5-J 53 Vp#* 53 ^-*- 


ctataatcac 


caaacatcac 

N ^-' 53 53 53 Vp* 1 53 \p* 53 v 


aataacaaca 


5460 


crtcfctcrcftcrc 


tcaatc tcaa 


cgccgaccgc 


atactaaaca 

53 53 ^— ' w 53 53 53 


ccgagttgac 


aaccaacaac 


5520 


aaatacccQa 


aaatcaccat 


tLctLcaacacci 

*-* \m* \J V** \^ 


c tec tat tec 


tec tcacaat 


acaaatcaac 

y^yy *■* y ^ 


5580 


^y^yyyy^yy 


tccgccgccc 


ootactccrcc 


ctcraccaaat 


tcaaacttac 


getgggee tc 


5640 


aacatatcaa 


toataaccaa 


CC tC tLGCCCCI 


aacaacccfct 

yy^y w^/vy^ *— 


acaacac tac 


aaat t tc taa 


5700 


tggccgccga 


tctgtctcgc 


gctgctcgcg 


gtgacggccg 


ggccggcggt 


gcgtgcgctg 


5760 


cggcgcgccg 


ttccggagcc 


ggeggegate 


cggtcgcgcc 


cgccgacgcg 


eggtcagggg 


5820 


ctggcggcga 


tgcggtcgcg 


cccgtcgacg 


ccgcgctgga 


geatgeeggt 


gagegeggeg 


5880 



a. accrcc tc ccr 


c a c crcfC c c crcr 


atgggcc tec 


aactcaccrca 

« 53 53 53 


gegecgagat 


ggtcgccgcc 


5940 


a crc t c qqc o cr 


acaaatcatc 


cracrcaccf tea 


acaacccrcac 

53 53 w * N " p * 53 i3 


caacgttggc 
^ 3 3 3 33 


ggecaggate 


6000 


tccgtccaca 


gtgccgcgcg 


accgcccgcg 


atccgcgtgg 


tgtcccgcac 


gccctggccg 


6060 


gccaggccca 


getgggcegg 


ggtgccgtcc 


agcatccgcg 


cggccagcag 


gccggccacc 


6120 


aaatcrccraca 


cgtgcoaoac 

^ 3 lw -3 v ^3^*3^*^' 


gagggccacc 


acccgatcat 

y^-wwyy s^wy <— 


gctcctcggc 


gctcatcagc 


6180 


ace crcr cert crcr 


caccacacfGC 


cgaaaccacg 


accaacraccc 

53 ^ 53 53 53 Vp" 


catcaacgac 


agegatactcf 

3 3^33 wyv^\*y 


6240 


c t tcacccrcrc 


ccraacrcraQacf 

3 3 3 3 3 3 *-* 3 


cacccagggc 


cgcccctcga 


acagcgagcc 


ccagcrccgcg 


6300 


tcrccrcrtccgg 


agegt tege t 


geeggegagg 

y^.^.yy«_yuyy 


ggatgcccgc 


caacatggga 

^* 3 y w 3 3 3 


agaggegteg 

33^33^3 ^3 


6360 


cagccgagca 


cctcgatctg 


cegggaggge 

^-"^'yyy , -*yyy^- 


agcactttca 


cgctcgccgc 


atcggtgtgc 
v> w ^ 3 3 w 3 3 


6420 


acccgaacco 


ttccccgccg 


ctggaggtcg 


accagcacga 
3 3 ^33 


gagegaegge 


cgccggcggc 


6480 


accgcgatca 


ccfcrccraccftc 

^* 53 53 % "* 53 ^* 53 ^ 


cacccgggtg 


cgcggctcgc 


ccgcgatgcc 


cgcgccgagc 


6540 


crc cere cere crcr 


coc crc crc acre 


a crc crcr era t c cr 

y y ^ 3 y y , — y 


gcgtccagca 


aa tcraacerat 

y y <-y *-y y »- 


gatgtcacgt 


6600 


t crcrcr t cracraQ 

3 3 3 3 3 3 


caaoaccaac 

^y My y • w y s-a *— 


caaaatccca 


atcaaccccfCT 

%p* w *i-*53 ^— ' 53 53 


tacccracgac 


gaccgcactg 

yv*^-^-y^-y*— ^ 3 


6660 


ccicacaaaGQ 

y w wv v»*L«iy y y 


ctcccatcgc 


cgcat tege t 


attcacfcrcca 


cgttcttctc 


aagtgtttcg 


6720 


gcgaattcca 


accactcttg 


tgaacaaege 


cgggccacgt 


tcgccaggac 


gtacgtgcaa 


6780 


tgcgacggaa 


c c crc cr t c gag 

53 53 53 *■* 53 


catgtctttc 


cactcatcat 


ettgaacegt 


gegggcaaga 


6840 


acc cat coca 


aaagat tgcg 


accaaa t tea 

53 N — ' 53 53 ^* 53 


gtaaatcgea 


crccraccrcrgtc 

53 53 ^* 53 53 53 *-* 


gttcttcaag 


6900 


cc t tgaagta 


ttgaccgcac 


cratcoac tea 

yy *— v y y v^^-y 


ageagecgaa 


tcgtgttttc 


cttgtcgaag 


6960 


gaaatgtcgt 


cgcggtgacc 


crcccotcrcocT 

53 53 w 53 ^3 53 


cacrcrcrcrccacf 


acaacaaatc 

3 ^"'-*y y y y v- 


ctcgccgcgc 


7020 


tcrcacfcrccrat 


tgcgcacgtc 


acgcgcggtc 


cracracrccrata 


cgccggcgtt 


ettggegate 


7080 


tcccacaciaQ 

>■» 3 "-3 3 3 


aaacatcaaa 

«y 3^3 3 3 3 


c tcrttcrctcra 


atgtagctga 


ccrcrccttcacr 


tcggccctcg 


7140 


gaattgtcca 


acaaacocac 

y wy yy vy vvav 


cccroccotcc 


cggccgacgc 


aaatcrccfccra 

yyy »-y ^— y v^y*-* 


gccctgcccg 


7200 


atcftccrccQcr 

LA l~ y V* y \— y y 


aaacrctccracr 

53 53 53 53 


crcaccaacat 

53 53 53 53 53 ^* 


atgttgccca 


caatcrcaaac 

^-yy yyy»— 


gctgagtccg 


7260 


c tcrcracrcrcccr 

^"3 3 t "* 3 3 ^ ^ 3 


c tattatcrcQ 


atctaaccaa 

y L- v»- *^ y w ^-y 


gaeggatgeg 


ac tcgatgat 


ccgttccgcg 


7320 


o c* cr c ct c cr t cr c 


cr cr t c c cr c t crfc 

yy i». ^ v»» y ^ t» y i— 


cr era c a cr t cr crc 

53 53 53 53 53 


acrcrc cert crc cr 

oyy v^<y ^y\_^y 


caatcrttccrc 


cttcfaccrcca 


7380 


acrcraccraatcr 


cc tec terete 


actaccctca 

y * — y v-' v v — i~ y 


aacatcgegg 


cc ttgatcag 


ctccftcacca 


7440 


cctc acre ccr cfcr 


cccfccrcccaCT 


c c crcr t cr c cr c cr 

v— • yy ^yv^y^y 


ccgtcgatga 


cccgcatgct 


tacacacrtac 

u<y\_rywyy ^yv^ 


7500 


acgatgatcg 


gcggcagttc 


ggegtcgagg 


ctggccagca 


tctgggtgtg 


ccgcggatcc 


7560 


tcgccggcca 


gccggggtga 


atcgagcatt 


tccagtgaat 


ccacaggtag 


ccactcgacg 


7620 


cgcatcattg 


acggagtgtt 


catctgtgtt 


tcgtcgtgcc 


gatccggctc 


gtggcgcgct 


7680 



gacgcgatgt gcaatgactc cataccaacc cccgaatggc tggactccga tgaccgctgg 7740 

acacgatgca ccgcaggtct caacgagatg ctttgacgac cgtcggccgg ctcggccgtc 7800 

gtggccgagt gcgcttgtgc agcgggcttt caatgtccga cactaatcga ggcggcgctg 7860 

gctcgccaag ctcgtaagag gttctctgac gcatttttaa gctgtcttca cggtgaagtt 7920 

tatcgatccc acggtgggtc ggtgggcgtc gagcgggatc tgatcgccac gatgcgtcat 7980 

cgtgtccgtg ccgttcgtac tggtcaaccg ggagatctcg ggggtgccac ggtggacgtg 8040 

ccggccgggt ccgggccgcc ccggccgagg gcgccgggga cgggttcggc cctatttctg 8100 

caagttgcag actgtgcgat tcgggtgggt cacggccgga gtgccgcgat ctcggtcgtg 8160 

atgaagcccg ccagccgctt gacaccctcg tcgatctcct ccggggtgag gtagctgatc 822 0 

gacaggcgga tgccctgttc gccgccgccg gccgggtaga agtacgacat cggcgtccag 8280 

atcaccccgt ggtcctcggc gcttcgggac agggcggcgt tgtcggcggt gaagggcacg 8340 

ttcacggcga ggaagaagcc gccgctcggg cggttccagc gcacgcccgt gcgtgcgcgg 8400 

aaggacgccg gaaggtgctc ctcgagccgg tcgagggtgc gccgcatcgc cgcgccgtag 8460 

tgcgccgagc tctccgcact cgcctcggcc gcggtgccgc cggccgcgag cagcatcccg 8520 

gccacggcgg cctggctcag cggcgaggtg ttcaccgtga ccatgctctt caccttcgcc 8580 

agctcgtcgg cgagcaggcc ggcgccgccc gcggcgtccg acaccggctg gtcggcgatc 8640 

gcgaagccca cgcgcgcgcc cggaaagagc gtcttggaga acgatccgag gtggacgacg 8700 

tgccggcccg ggtcgagggc cttgagggag ggaagctgct gccccgggct gacgagccgg 8760 

tacgggctgt cctcgatcac cagcaggccg agctcgccgg cgaggtcgag cagggcgtgg 8820 

cgggcctcca gcggcatcgt ggcgcccgac gggttggtgt ggtcgggcac gacatagaag 8880 

gcgcggggac gccggccccg gctcagctcc gcgtggaccg cgcgggccag gtcctcggga 8940 

tggaagccgt cctcgcgctc ggcgacgggc accgggtcga tgtcgagcag ccgcgcggcg 9000 

ccggtgatgc cgacgtagca cgggctcgcc acgaacagcg cgtcgcgctc gtcccggatc 9060 

agcgcgcgca gcgccagcag catcgcctcc tgcgcgccga ccgtcacgac gatggactcc 9120 

ggagccacgt cgatgccctc gtcgcgccgc agccactgcg cgatcacctc gcggatgcgg 9180 

ccggccgccg ggccgtactg gaagacggcg tccctgatct ccgcgggcga gcggccctgg 9240 

ccggcgaggt gctcgagata gccgcggatg ccgcggaaga tctgctcgac gtcgaagaac 9300 

ccgtcgaacg ggcggccggg cgcgaacgag acggctcgcg gatagcgggc ggtcacctcg 9360 

ttgaggaagt tcatggtgtc cagcagcggg tcggacaggc tctggtgcag gtcctcgcgc 9420 

cgcaggacgc ggccggtgcc cggagcctcg cgcaggatgc tcatcatggt ctcctcggtc 9480 
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gtcgcgaagc cctgactgtg gcaccgcatt ccgcaccgat caatcgaagc cacggcggtg 9540 

accgcacacc tgtgcaactg ccgcgaacgc acggttggac accgtcggtg agctgcgttg 9600 

gtggacggtt cggccgacgt ccatagtggt ccgggggctc gctcggccgc ctcgccggcg 9660 

cctccaggcc gccagcgaaa ggacgccgcc gtgaccgcca ccgccctcct gcccctgacc 9720 

ctcgcggact acgaacagct ggcccaagcg cgaatggagc ccccggtgtg ggacttcatc 9780 

gccggcggcg cgggggagga gctgacgctg gccgcgaaca ccgccgcctt cgcaccgccg 9840 

cggctgcggc cacgggtgct gaccggcgcg ggcgcgccgg acacgggcac gacgatcctc 9900 

ggacggcggt gggcggcgcc gatcggcgtc gccccgctcg gctatcacac gctcgtcgac 9960 

ccggcgggcg aggtcgccac cgccgcggcg gccggcgcgg ccgggctgcc gctcgtggtg 10020 

agcacgttct ccgggcggac cgtggaggac atcgccgcgg ccaccaccgc gccgcgctgg 10080 

ttgcaggtct attgcttccg cgaccgggcg gtcaccgccg cgctcgtcac gagggccgtc 10140 

cgcgccggct tcgaggcgct ggtgctcacc gtcgacgcgc cgcggctggg ccgccgcctg 10200 

cgggacatcc gcaacgactt ccgcctgccg cccggcgtgg cgccggcaaa cctcaccggc 10260 

gacggcttcg cgtcgcccag cgggcacgcg ctcggcgcgt tcgacgccgc gatggactgg 10320 

accgtcgttg cctggctgcg ggagctcagc gggctgccgg tgctgctcaa gggcgtgctg 10380 

accgccgacg gtgcccggcg ggcgctcgac gcgggtgcgg acgggatcgt cgtctccaac 10440 

cacggcggcc ggcagctcga cggcgtgccg gcgacgctcg acgtgctgcc cgaggtggtg 10500 

gcggccgtgg ccgggcgctg cccggtcctg ctcgacggcg gcgtgcggcg cggccgcgac 10560 

gtcctgctgt cgctggccct cggcgccgac gcggtcctgg taggccgccc ggtgctgtac 10620 

ggcctcgcgg tcggcggcac ggccggcgtg cggcacgtgc tcgacatcct cgcgggggag 10680 

ctgaccgacg acatggccct ggcgggcgtg gcctcgcccg cggacgccgg cgcggacctg 10740 

gcgggcccgg tcgcgccgta gaggcggtcc atacgactgc ggcggccggg aatacccggc 10800 

cgccgcaacg tcgtcctcca gggacatctc aggacggcac ggccggctgg cgctcctcgg 10860 

tgcgctggcc ggcgaactgg gtccggtaca gctcggcgta caggccaccg tgcgcgatca 10920 

gctcgtcgtg ggtgccgcgc tcgacgacgc ggccgtcgtc gatgacgagg atctggtcgg 10980 

cgtccaggat cgtggcgagc cggtgggcga tgacgagcga cgtacgcccg gcgagcgccg 11040 

tgtccagggc ccgctggatc gccgcctccg attcggagtc cagatgggcg gtggcctcgt 11100 

cgagcaccac caccgggggc gatttgagca gcaggcgggc cagggcgagg cgctgcttct 11160 

cgccgcccga gagccggtag ccgcgatcgc cgaccaccgt gtccagcccg tcgggcagcg 11220 

aggacaccat ctcccagatg cgggccgcct cgcaggccgc gacgaggtcg cgctcgccgg 11280 

10 



cgtcgggacg gccgtagagc aggttggccc ggatggtgtc gtggaacagg tgcgcgtcct 11340 

gcgtgaccac gccgatggac tcgctcagcg agcgcagggt gaggtcgcgg acgtcgtggc 11400 

cggcgatccg gaccgtgccc gaggtggtgt cgtagaggcg cggcaccagg tgggtgatcg 11460 

tggtcttgcc ggcgcccgac gggccgacca gggcggtgag cttgccggcc ggggcgagaa 11520 

agctgatccc gttgagcacc ggccgctccg cggtgccgtc ggaggaccgc tgggccacgg 11580 

tctccaacga ggccagcgag acctcgtccg cgcccggata gcggaacacg acgttgtcga 11640 

actcgatgtc cggcgccgcc gagcggcccg gctcggcggc cggcagggcg cgggcgcccg 11700 

ggcgctcctt gacgagcggg tcgaggtcga gcacctcgaa gacccggtcg aagctcacca 11760 

gcgcggtgac gacgtccacc tggatgttgg tgagctggtt caccgggccg tacagctgcg 11820 

ccagcagggc gaccatggcg acaagggtgc cgatgccgag cgtgccgtcg atgaccaggg 11880 

cgccgccgaa gccgtagacc atggccgtgg tcaccgtggt cagcagcgtg gcgatgatga 11940 

acagcagccg tgcgtgcacg cccatcgaga tggcgatgtc acgcacccgc gcggcccgcc 12000 

cggcgaaggc ggtctcctcg ctctccggcc ggccgtagag cttgaccagc atggcgccgg 12060 

agacgttgaa ccgctcgttc atcatcgagc ccagctccgc gtcgacctgc atgccgccgc 12120 

gggccagccg ctccagccgg cccgcgatga gcttgccggg caggaagaac agcgggatga 12180 

gcaccagcgc caccagcgcg atcgcccagg agaggtagaa catcgcgccg atgaccagca 12240 

cgacggtcag gaccgtggag accgtctgcg tgatcatcga ggtcatggcc tgctcggcgc 12300 

cgaccacgtc ggtgttgagc cggctcacca gcgaccccgt ctgcgcccgg gtgaagaacg 12360 

ccagcggctg gcgcatgacg tgggcgaaca ccttggtgcg caggtcgtag atgagaccct 12420 

gcccgacgcg cccggaggcc agcgtctgca cgtggatcgc cgccacgttg accagggcga 12480 

ggccggccac caccagcgac agcccgacca ccacgtccag gcggccggcc acgatgccgc 12540 

ggtcgatgat ctgcttgagc agcagcgggt tggcgaccgt gagcgcggcg tcgaccacgg 12600 

tcatcagcag ggcgacggcc agggaccagc ggtgtttctt cgcgtacggg aggatgcgcc 12660 

gcaccgtgcc gggcctgacc ttctgcatcg gtatgaggcc gtcgacgcgc agcccgatga 12720 

cgcccagatc cgggccgtcg atggttgcca cggaaccttc tctctcgcgg tagcgggtgc 12780 

gccggtcgtg ccggcgcgtc cctagtggga gtcgaggaac tcgaggatcg cggcgttcac 12840 

cgcgtccggc cgttccagat agccgaggtg cccgcagcgg gatatctcga ccaggtcgca 12900 

gtcggggatc gcctcggcca cctcggcggc caggtgcggc ggcgtgatga ggtcgtcggc 12960 

gaacgagatc acccggcagg gcgcggtgac cgagcgcagg gcggcgcgcc ggtcaccggt 13020 

cagctcggcc caggcctgcc cgccggccga gacgcccgtg ccggagagct cgaagatgtc 13080 
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gagccactcg cgcacggcca cgtcgtcgtt gagcgtcgcc ggggagaaca tcttgaagac 13140 

cgcggtcgcg gcctcgtacg cggcgggcag cgtgaccccg ctctccagca ggtcgatgtc 13200 

ggcctggttc tgcgccgcgc gggccgcgtc gggccgggcc agggtcgcga tcaggaccgc 13260 

gcagcgcacc agctccggat ggtcgacggc cagctcctgg gcgatcatcg cgcccagcga 13320 

ggtgccgacg atccggcagg gcgcgagatc gagggcctcg atgagaccgc gggtgtcggc 13380 

ggtcatgtcg gcgagggagt acttgccggc gggcacgtcc gagggcggga tgccacggct 13440 

gtcgaagacg acggtggaat atcccgcctc gtgcaacgcg ggcgtctggt gcaccgtcca 13500 

ggtctggccg gccgagcccg atcccatgat catgagaacg ggctcgccgg ctccggagcg 13560 

gcgatacgcc aggcggaccc cgttggtggt gacgaaaccg gagcccgcgg cgctcaccag 13 620 

cgccactcgc gcttgttgaa caggtgctcg gcggtgaagc cccggtccgc gcaccacgcg 13 680 

aggaactcgt cgatctgctt gagctggtag gtgtcctcgt tgtaggtcgt cgccatgacg 13740 

tggccctgcc aggactcctc gcccatggcg tagatgtatc cggcggtggc gccgagctcg 13800 

gtcatgatcg cgccggcctg ctccgcgttc gatccggaca gccgccgtga ggcgctcatc 13860 

ttcttgttga ccggcttggt cagcagtccc ttgtagagcc agttgagcgg cgcgccgtcg 13920 

cactccatgc cgaggaacgc catgtcgacc tggccgacgt ggtcgcggat gtagcggtag 13980 

agcaccgggt cgatgcccga cgagtcggcg ccgatgaaga tggtgcgccc ggccatccgc 14040 

acgaagtacg tcgacttccc gcggatgtcc agatcggcgt gctcgcccag gaacggcgtg 14100 

gcggtgaccg tgccgcccgg gaacggcacc tcgtcgaact cctcgacctc gacgacggtg 14160 

aaaccgatcc ggcgcaggta cagcgcgatc gacgggtccg ggaggttgcc gcggctggtg 14220 

cgcggcacga caacggtgcc gatccggccg cgcagctgca gcagggtctc gagcacgatg 14280 

tggtcctggt gcccgtgcgt gatgagcacc aggtcgatgt ggtcgggcag gtcgtcgagc 14340 

gtgtaccggt cgccgtggcg gttgtcggtg ctgatgaacg ggtcggtgac caccgccgcc 14400 

tgctcggtct gcatcaccac gcaggcgtga ccgtagtagc ggacccggcc gccggactcg 14460 

atgtgccggt cgggcgacag gctcggctcg tcggtgagca gggcatccag cgcggccgcg 14520 

ccggcgtcgt cgagctccag cgcctcgcgc aggcggttga gcgaggtggg ccgcacgcgg 14580 

gcgtcgaaga gctcggtcag cccgggatgc cgcaggggca gcggaaggtc gagcacgcca 14640 

gcccggggaa gccgcggtgt gctgaggatg aacgggcgct ccaccccgtc gtcgagcgag 14700 

agctgcaccg actgccgccg ctcgtcgtag gccgggctgc ggtagagcag cggctcgagg 14760 

aagtgcagcg agggctggtt gctggtgtcg tacgcgatct ccaccagccc gttgagcgcg 14820 

gccggcagcc gggggtagag cggcgtcagg tcgtagccga ccgcgttctc gcggatcagg 14880 
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tcctcggcct cggccaccgc ggcggcgaag ccgagcatgt cggcccgctc gttcttgatc 14940 

gtcttgagca gctcggcgac ctcgtcgctg cgggattcct cgacggcgac gaagtagccg 15000 

ccgcgcatct tcgggttcgt gctcgccgcc acgtgcaccg gcggcgactg caggtacgac 15060 

tcgagcagag gcacctgcag gaacgccagg ttcatggagg ccggcaccgg tgccaccgtg 15120 

tgcagccacg cgtagaaacg gtcgacgagc ggttcggcga tgacgttgga gcgcaggtat 15180 

ctcggctgat cggcgttgcc catgtgaacc tctccatcct gatggcgggt gcgcccggag 15240 

cgggcacgat cgggcgggag ggaatgcggc ctcggtacgg ccggcgcggc aaacggcgcg 15300 

tccgtccctc cggcgtgggc gcggtccact atggcatcgg cccaggagcc ggtacagcgt 15360 

gcacgtgcgc ggcccgatcg cgtcgaacgc cctgtacaac cggcggtgcg acgcagtcga 15420 

acagcggtga acgaccgagg cggcggcggc ccgttgatcg gccgcccgag cgcccgcata 15480 

gtgaatcgcc ggaggggcct tggagagccg gcccgccgtt cccggccatc gccgggcgcc 15540 

cgccctcaca aaccgatcgc ccggcgccgc cgggtcgcac ggaggtgccc atgtccgaga 15600 

ccgacctgtc cgccgcccgg cacacgcccg agcagatccg ctcctggctg atcgaccgga 15660 

tcgcctacta cgtgatgctg ccgacccagg agatcgagcc ggacgtgtcc ctggccgagt 15720 

acggcctgga ctcggtgtac gcgttcgcgc tctgcggcga gatcgaggac acgctcggca 15780 

tcccgatcga gccgaccctg ctgtgggacg tcgacaccgt cgccaccctc accgcccacc 15840 

tcgccgaccg cgtcaaccga taagccggag gtgaccgtcg tgcccacccc tgacctgcgc 15900 

ccgctcacgc ccgcccagct cgccgtctgg cacgcgcagc agctcgcccc gcacagcccc 15960 

gtctatcagg tcggcgagtt cgtcgagatc gacggcgagt gcgaccccga tctcctggtg 16020 

gcggcgttgc gtcaggtcat gggcgaggcc gagagcgccc ggctgcggtt ccgcgtgatc 16080 

gacggtacgc cgtggcagta cgtcgccgag gacggcgacg acccgatcca ggtcgtggac 16140 

ctcggcgcgg ccgcggaccc gcgcgccgcg gcgctgggcc gcatggcggc cgacctcgac 16200 

cggcccggcg acctgcgcga cggcccgctc gtcgagcacc acgtctacct gctcggcgag 16260 

ggccgggtca tctggtacca ccgcgcgcac cacatcgtct gcgacggcgg cagcctcggc 16320 

attgtcgcct cccgggtggc cggcgtctat tccgcgctcg cggccggtgg tgacgtccgg 16380 

ccgggtgcgc tgccgccgct gtcggtgttg ctgtcggccg ccgacgccta cgagcgctcc 16440 

ggcgaccgcg accgggaccg cgagcactgg cgctccgcgc tggcgggcct gcccgccgag 16500 

ctgctcgcgg gcgcgggccg gccgcggccg ctgcccggac cgccggtgcg ccacgagcac 16560 

gacctctccg cggcggaggc gggccggctg cgcgcggggg cgcggcggct gcggaccagc 16620 

gtggcgcagg ccggcatcgc ggccgcggcc ctctaccagc accggctcac cggcgcccgg 16680 
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gggcatccgc gggtagcggg tgcggcggtg gtggctcggc cggatgcggt gggtgagcgg 18540 

cagttggtgg cctatgtggt cgctgccggt gagccggccg ggttggcgga gtcggtacgc 18600 

gcccacgtcg ccgagcgcct gcccgaatac atggtcccgg ccgccgtcgt gaccctcgac 18660 

gagatcccgc tgaccgtgaa cggcaaggtc gaccgcgccg ccctgcccga gcccggcccc 18720 

gtcgccaccg gcaacgccga ccgcgagccc acgaccgagc gcgaatcgct gctctgcggc 18780 

gccttcgccg acgtgctcgg catcgagcgg gtcggcgtcg acgacgactt cttcagcctc 18840 

ggcggccatt ccctgctcgc caccagcctg gtgagccggg tacgcctcgt gctcggtgag 18900 

gaactgccca tcgaggagct cttcgccacg cccacccccg ccgagctggc ggcctggctg 18960 

caacgcaacg cggaccggcc gcaaccggcc cggccggcgc tgcgcccgat gcacgaaagg 19020 

gaaacgaccg catgaccccg atgtcgtacg cccagcgccg tctctggttc cagctgcggg 19080 

tcgagggccc cgacgccacg tacaacagtc ccgccgtcct gcgcctcacc ggcgagctcg 19140 

acaccgccgc cctggagcac gcgctgcgcg acgtcctcga acggcacgag gtcctgcgca 19200 

cggtctatcc cgacgtcggc ggcgagccgc ggcagcgcgt ggttcgcccg gacgacatgg 19260 

tgtgggagct gcccacgacc cgggtgtccg gtgccggcgc gggcgacgac cggctcgtca 19320 

cgctcgacga gctgccctgg gaccgcccgg tgctcgacct gccgtcgccc gcaccggccg 19380 

gccgggaacc ggacggcgag atcaccgtcg acgagctgcc cggcgcgatc gcccgggtgg 19440 

cggcccaccc cttcgacctc tccatcgaga tcccggtgcg ggcgcggctg ttcgccctgg 19500 

gcccgcggca ccacgtcctc gtcgtcgtgc tccaccacat cgccaccgac ggcagctccg 19560 

gcgggccgtt cgcccgcgac ctcgccgccg cctaccgcgc ccgccgcacc ggaacggcgc 19620 

cccagtgggc accccttccg gtgcagtacg ccgactacgc ggcctggcag caggaactgc 19680 

tcggcgccga ggacgacccc gacagcgtca tctcgcggca gctcgcccac tggcaggagc 19740 

ggctggccgg catgccggtc gagctggatc tgccggccga ccgtccgcgg cccgccgaac 19800 

ccgggcacgg cgggcacacc aaggcgctga gcctgccccc ggccgtgcac cgaggactgg 19860 

ccacgctggc ccggcggcgc cgcgccacgc ttcaaatggt cgtgcagacc ggcgtcgcga 19920 

tcctgctgtc caagctcggc gccggccgcg acgtcccgct cggcatcccg gtcgccgggc 19980 

gcaccgacgc cgcgctcgac gacctgatcg gtttcttcgt caacaccttg gtcgtacgcg 20040 

ccgacctgtc cggcgacccc acggtggccg acgcgctggg ccgggtgcgc ggaggcgcgg 20100 

tggccgccct ggccgaccag gacgtgccct tcgacaaact cgtcgagcgg ctggcgccgg 20160 

cccgcgtgct cgggcggcac ccgctgttcc aggtcatggt cgcgccgctc gacgacggga 20220 

cgccgatcga cctggacggg gtgcgaggcg agccgctcac catcggccgc tccggtgcca 20280 
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agttcgacgt cgaggtgatg accggagagg tacgcgcggc ggacggcgcg ccggccggca 20340 

tccgcggaat cctgacgctc tcggccgacc tgttcgacga ggcgacggcc ggccggatgg 20400 

ccgccgggct ggtgcgggtg ctgaccgcga tggccgaggc cccggagcgg cggctctccg 20460 

gcatcgaggt gctgtcgccg ggggagcggt cgcggctgct ggtggagtgg aacgacacgg 20520 

ctcgtccggt ggtggagtcg tcggtgccgg cgttgttcgc gaagcgggtg gcggccacgc 20580 

cggacgcgac ggcggtggtg ggcgagggtg tgtcgtggtc ctatcgcgaa ctcgatcgtc 20640 

gctcggatgt gctggcgcgg cgtctggtgg cggcgggtgt gggtgtggag tcgccggtgg 20700 

tggtggcgtt ggagcggtcg ccggaggtgc tgtccgcgtt tttggcggtg gcgaaggccg 20760 

gtggtgtgtt tgttccggtg gatttgtcgt ggccgcaggc gcgtgtcgat gcggtggttg 20820 

ctgactgtgc cgcgcgggtg gcggtggctg accggccgat gagcgggctg acggtcgtgt 20880 

ccgccggcct gggcggggat tcggccgtcg tgtccgccga cctgaccgcg gatcgggccg 20940 

ttgtgttgcc gtctcgcccg gtgccgggtg cggcggtcta ccggatgtac acctccggct 21000 

cgacgggccg ccccaagggt gtggtgacca cccaccagaa cctggtggat ctggcgaccg 21060 

acacgtgttg gggtccgacc ccgcgggtgc tgttccacgc cccgcacgcc ttcgacgcgt 21120 

cgtcgtacga aatctgggtg cctctgttga atggcggcac ggtcgtggtc gccccgcagc 21180 

gcagcatcga cgccaccgtc ctgaaggatc tgatccgcgc gcatgatttg acgcacgtgc 21240 

acgtcaccgc gggcctgctg cgggtgctgg acccgtcgtg cttcgcgggc ctgaccgagg 21300 

tgctcacggg tggcgatgcg gtgtcggcgg aggcggtgcg ccgggtcaaa gacgcgaacc 21360 

cgggtctgcg ggtgcggcag ctgtacggcc cgaccgaggt gacgctgtgc gcgacgcagc 21420 

atctgctgga tgacggggtg ccgatcgggc ggccgttgga caacacccgc gtgtatgtcc 21480 

tggacgactt gctgcagccg gtgccggtcg gtgtgaccgg cgagctgtat gtggccggtg 21540 

ccggcgtggc gcgtggctat gcgggcatgc ccgggttgac ggccgagcga tttgtcgccg 21600 

acccgttcaa caccggcggt cgcctctacc gcacgggtga tctggtgcgg tggaccgatg 21660 

acggtgtgct gcatttcgcc ggccgagccg atgatcaggt gaagatccgc ggctatcggg 21720 

tggagccggg cgaggttgaa gcggttctgg ctcaacatcc cgatgtcagc caggtcgcgg 21780 

tggtcgtccg ggaggacacc ccaggcgaca agcgtcttgt cgcgtatgtc gtcggcggtg 21840 

acatcgaggc gtatggccag gagcgccttc ccggttacat ggtcccgtcg gcattcgtac 21900 

atctggatgc gttgccgctg accagcaacc agaaggtgga tcgggccgca ctgcccgcgc 21960 

cgtccatgga gtccggcgcc ggtcgagctc ccgcggatgc ccgcgaggag ttggtgtgtg 22020 

ccgcgttcgc cgaggtgctc ggcctggatc gggtcggtgt cgacgacgac ttcttcgctc 22080 
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tcggcggtca ctcgctgctt gccgtctccc ttgtggagga tttgcggcag cgtggcctgc 22140 

acgtctccgt tcgcgcgctc ttcgccacgc cgacccccgc ggcgctggcc gtctcgaccg 22200 

tggcggcccc gatcgaggtg ccgcccaacc tcatccccca gggcggcgcc cgggaactga 222 60 

cccccgacat gctgcccctg gtcgacctga ccggcgagga gctggccacc atcgtggccg 22320 

cggtgcccgg cggcgctgcc aacatcgccg acatctaccc cctagccccg ctgcaagagg 22380 

gcatcttctt ccaccacctc atgaccgagg gcgacaccgc cgacgtctac gcgctgccgt 22440 

acctgctgcg cgttggcacg cgtgagcagc tcgacgcctt cctcggggct ttgcagcagg 22500 

tggtggaccg ccacgacgtc taccgcacgg ccatcgcctg gcagaacctg cgcgagcccg 22560 

tccaggtcgt gcaccgccac gccaccctgc ccgtcaccga agtcaccccc gaccaactgc 22620 

acgccgccgc caccggcggc cggctcccgc tcgaccacgc acccctgctc agcgtccaca 22680 

tcgcacccga acccgacggc ggctggctcg ccctactgcg catgcaccac ctcgtgcagg 22740 

accacaccgc cctcgacatc gtcctcgacg agatccgcac catcctcgcc ggcgcaaccg 22800 

accacctccc cccgcccgta ccgttccgca atttcgtggc gcgctcgcgg cgtggtgccg 22860 

ccgaggccgc gcaccgcgac tacttcaccg gcctgctcgg cgacgtcacc gagaccaccg 22920 

ccccgtacgg cctcaccgac gtgcacggtg agcactccgg cgtgcgccgg ggccggctcg 22980 

ccgtgtccgc cgggctcgcc ggccgggtgc gggagaccgc tcgcgaccgg ggcgtcagcc 23040 

ccgccaccct cttccacctg gcctgggcgc gcgtgctcgc cgccgtctcg ggccgcgacg 23100 

acgtcgtctt cggcaccgtg ctgctgggcc ggatggatgc cggcccgggc gccgaccggg 23160 

tgccgggcct tttcatgaac accctgccgg ttcgcgtacg cctcggcggc cgcaccgtcg 23220 

acgaggcgct gcacgggatg cgcgcccagc tcgccgacct gctcacccac gagcacgccc 23280 

cgctggtgct cgcgcagcag tcggccggcc tgcccggcgg cagccccctg ttcacctcgc 23340 

tgttcaacta ccggcacaac gcgaccgaca tcgagcgctc cggaacgggc atcgacggcg 23400 

tcgaggcgct gcccaccggc gacccgtcga actatccgct cgacgtctcg gtcaaccaga 23460 

gcccgctcgg cttcgagctc gtcgtcgagg ccaccgagcc ggccgacccg gaccagctct 23520 

gccggctcct gcacgcctgc ctcgacgacc tgatcgccgc cctcgacgag cagcccggcc 23580 

gcgcgctcgg caccctcgac gtcgtcgccg gacgggagcg cgacctcctc ctggacggct 23 640 

ggaacgccac ggcggtcccg gcccagccgg ccctggtgcc ggagctcttc acggcgcagg 23700 

cggcgcggac acccacctgg ccggcgctgg tgacggccgg cgcggagatg tcctacgccg 23760 

agctcgagga gcggtccaac cggctggcgc gctggctggc cgggcgcggg gtgggcgccg 23820 

acgaccgggt ggccctgatg atgcgccgcg gcccggagct gatggtggcg atcctggccg 23880 
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tgctcaaggc gggcgcggcg tatctgcccg tggatccgga cctgccccgc gaccgggtgg 23940 

actacctgct cgcggacgcg gccccggcgt tcgtgctggc cgagcgggcc accgcgccgt 24000 

gggtgccggt ggccggtggg attccggtgc tggtcgtgga cgcgccggcc gtcgccgccg 24060 

aggtggcggc ccactcgggc gaggccgtca ccgaccggga ccggcgcgcc gccctgcgcg 24120 

gcggccacct cgcctacgtc atctacacct cgggatccac cgggcggccc aagggcgtgc 24180 

tgatcacgca cgacggcctc gccaatctca cgctcgacca cggccggttc ggcctcggtc 24240 

cgggtgcccg ggtggcgcag ttcgcctcgc ccggcttcga catgttcgtc gacgagtggt 24300 

cgatggcgct gctggccggc gccgcgctga ccttcgtgcc gccggagcgg cgactgggcg 24360 

ccgacctggc cgcgttcctc gccgagtacg gcgtgacgca cgcgacgctg ccgcccgcgg 24420 

tcgtgggcac gatcccggac ggtgtgctgc cgccgtcgtt cgtgctcgac gtcggcggcg 24480 

acgtgctgcc gggcgacctc gcgcgccggt ggctgcgcga cggccgggtg ctgttcaact 24540 

cgtacggccc gacggagacc acggtcaacg cggcgacgtg gcgggccgag gccggtgact 24600 

ggggaagcgt ggcgccgatc ggcacgcccg tgccgaatct gcgggcgtac gtgctcgacg 24660 

gctggctgcg cccggtgccg gtgggcgccg acggcgagct gtacgtctcg ggcgccggac 24720 

tcgcccgcgg ctatctgaac cgggccggcc tgaccgcgga gcggttcgtg gcgtgcccgt 24780 

tcgagccggg ggagcggatg taccgcaccg gcgacgtggt gcggtggacc gccgaggggc 24840 

gcctggtgtt cgccggtcgc tcggacgacc aggtcaagat ccgcggattc cggatcgagc 24900 

ccggcgaggt ggaggccgtg ctggccgccg gaccgggtgt gagccaggcc gcggtgatcg 24960 

tgcgtgagga cgtgccgggt gacaagcgct tggtggcgta cgtcgtgggc ggtgacgtcg 25020 

aagctctccg gtcgtacgcg cagcagcggt tgccgggtta catggtgccg tcggctttcg 25080 

tggagctgga ccggttgccc ttgacggtga acggcaagct cgaccgccgg gccctgccgg 25140 

tgcccgacct cgcccgcggc acgggatccg gccggcccgc cggcacgccg cgcgagcagc 25200 

tgttgtgcgc gggcttcgcg gccgtcctgg gcgtggacga cgtcggcgcc gacgacgact 25260 

tcttcgcgct cggcggtcat tcgctgttgg tggtttctct ggtggagtgg ctgcgccgtc 25320 

gtggggtctc ggtgccggtg cgggcgttgt tcaccacgcc gaccccggcc gggctggccg 25380 

aggcggtcgg tgatggtgcc gtggtggtgc cgccgaacct gattccggaa ggtgcggccg 25440 

agctgacccc ggagatggtg ccgctggccg atctgacgtc cgaggagctg gcgatcgtcg 25500 

tcgcatcggt gccgggcggt gcggccaatg tggccgatgt gtatccgttg gcgccgttgc 25560 

aggagggcat cttcttcccc gtagccacag gcccccagtg ctacgccacg gtggggagtt 25620 

cactcccgga cgatggcggt tctgcccctt gcagcaggtt tcgccgtcga tgtgtatcga 25680 
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cgagtgtggt gtggcagggg ctgcgtgagc cggtgcaggt ggtgtggcgg cacgcgcggc 25740 

tgcccgtcga ggaggtcgtg ctgcacgagg gggccgaccc ggtcgagcag atgatggcgc 25800 

tcgccggcgg ttggatggac ctcacccggg cgccgctcat cgacgtccac atcgccgccg 25860 

gccccggcgg cgaccgctgg ctggccgtgc tgcgcatcca ccacctcgtg caggaccaca 25920 

ccgccctgga gacgctgctc gacgagctgc agtcctttct ggagggccgc ggtggcgagc 25980 

ttgccgagcc ggtgccgttc cgcgagttcg tggcgcaggc gcggctcggt gtgccgcgcg 26040 

aggagcacga gcggtatttc gcggagttgc tcggcgacat caccgagacc accgcgccgt 26100 

acgacctgac cgacgtgcac ggcgacggca ccggatacga ccacggcgcg ctgccgctgg 26160 

acgccaccgt cgcggcccgc gtccgggagg cggcccgaac cctcggcgtc agcccggcga 26220 

cgctcttcca cctcgcgtgg gcgcgggtgc tcggcacgct ggccgggcgc gacgacgtcg 26280 

tcttcggcac cgtcctgttc ggacggatga actcgggtgc cggcgccgac cgggtctccg 26340 

gcctgttcat caacacgctg ccggtgcggg tgcggctcgg cgcgcccacc ggcgacgccc 26400 

tcggcgacct ccgcgaccag ctcgccgagc tgctcgtgca cgagcacgcc tccctcgcct 26460 

ccgcgcagaa ggcgagcggc ctgcccggcg ggagccccct gttcacgtcg atcttcaact 26520 

accggcacaa ccaggtgagc gccgaacggg agaccgccgc gctgcccggc atccgcgtcc 26580 

tcgcggcccg cgactccacg aactatccgc tcaccgtcgc ggtggacgac gacgggcacg 26640 

gcttcacgct cgtggtcgag gtcgcgtcca cagtcgacgc cgcgggcgtc tgcgaactgc 26700 

tgcacaccgc cgtggacaac ctcatcgcgg ccctcaccga ccggccgggc gggccgctgg 26760 

ccgaggtcga catcctcgaa cgcggtctgc gggaccgcct gctgaccgcc tggaacgagg 26820 

cccgggagcc cgcaccgccg gtgaccctgc ccgacctgtt cgaccggcag gcccgccgca 26880 

cgcccgaggc ggtcgcgctc accgcggacg gcgtctcgct gacctaccgc gagctgtcgg 26940 

agcgcgccaa ccggatcgcc cggctgctca cctcccgggg gatcggcccg gagtcgctcg 27000 

tcggtgtcgt cctgccccgc tcggccgacc tggtcgttgc gctgctcggc gttctccagg 27060 

ccggcgcggc ctacgtgccg gtcgacgccg actacccggc cgagcggatc gggtacatcc 27120 

tcggcgacgc gggcgcggtt tgcgtgctca cagtggacgc gaccgcgggc gccgttcctc 27180 

ccggcgtacc gaaactggtg ctggaccacc cggaaaccgt gaccgcgctg gcggcgtgtg 27240 

acacggcacc gctcggcgag gccgagcggg ccggcgaact gctgccggag caccccgcct 27300 

acgtcatcta cacctccggc tccaccggca cgcccaaggg cgtgctcatc ccgcaccgca 27360 

acgtggtcga gctcttcgcc gccacccgcg gctcgttcca cttcggcgag ggtgacgtgt 27420 

ggtcgtggtt ccactcggtc gccttcgact tctcggtctg ggagctgtgg ggcgcgctgc 27480 
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tgcacggcgg ccgcgtggtc atggtgccgt tcgccgtctc ccgctcaccg cgggatttct 27540 

gggaactgct cgtgcgcgag cgggtcaccg tgctcagcca gacgccgtcc gccttctacc 27600 

agctcgccgc ggccgccgac gacacgccgg acgcgctgcg cgtggtggtc ttcggcggcg 27660 

aggccctcga cccgggacgg ctcgccggat ggcgggaacg gcgcccggac ggaccgcgcc 27720 

tggtcaacat gtacggcatc accgagacga cggtccacgt cacccaccag gacctcgcac 27780 

cggccgacac caccggcagc cccatcggac gcggcatacc gggcctgtcc gtctacgtcc 27840 

tcgacgaggc gctgcggccg gtgccgcccg gggtcgccgg cgaggtgtac gtggccggtc 27900 

gccagctggc ccgggcctac ctcggccgcg ccgcgctgac cggcacccgc ttcgtcgcct 27960 

gccccttcct cccggccggg gaacggatgt accgcaccgg cgaccgtgcc cgctggagcc 28020 

gcggccggtt gcagttcgcc gggcgcaccg acgaccaggt gcagatccgc ggcttccgca 28080 

tcgaaccggg cgaggtgcag gccgtcgtcg ccgcccaccc tgagatcgcc gcggcggccg 28140 

tcgtcgtgcg cgaggacgtg cccggcgacc cgcgcctgac cgcctacgtg gtgccggccg 28200 

gcccgcgcac cgcgccggcg gccgtcgcgg aaaccgtgcg gcgcttcgcc gccgaccggc 28260 

tgccggccta catgcttccc tccgcggtcg tcgtgctgga cgccctgccg ctgaccgacc 28320 

acggcaagct cgaccggcgc gcccttcccg caccgcagca cacgggcgcc gcgagcggcc 28380 

gggcgccggc caccgtggcc gaggaggtgc tgtgcgccgc cttcgccgag gtgctgggcg 28440 

tcgagcgggt cggtgtcgac gacgacttct tcgccctcgg cggccactcg ctgctcatcg 28500 

tctcgctcgt cgagcgcgtc cgccgcgccg gcctggcgat cccggtccgc gccctgttcc 28560 

gcagcgccac cccggccggg ctggccgcgc tcgcccggcc gtaccgggtc gacatcccgc 28620 

ccaacctcgt cccggacggc gcccgcgaga tcaccccgga catgctgacc ctggccgcgc 28680 

tcaccgaggc cgagatcgcc acggtgctcg cgaccgtgcc cggcggggcg gtgaacgtgg 28740 

ccgacatcta tccgctggcc ccgctccagg agggcatctt cttccaccac ctcatggcgg 28800 

acgccggccg ggccgacgcc tacgcgatgc cgtacgtgct gcacctggac acggcggagc 28860 

ggctggacgt cctcctcggc gccctccagc gggtgatcga ccgtaacgac atctaccgca 28920 

ccggcgtggt ctcggccggc ctgcgcgaac cggtgcaggt ggtgtggcgg tcggccgtcc 28980 

tgcccgtcga ggaggtggcg ctggacggcg gccacgaccc ggtcgagcag ttgctcgccg 29040 

ccgccggcga ggagttcgac ctgacccggg cgccgctgat ccgggcgcac gtggcggcgc 29100 

atccggacgg cggccggctg ctcctgctgc gcatccacca cctcgtgcag gaccacacga 29160 

cgttcgacgt ggtgctgggc gagctgcggg ccttcctcga gggccgcggc ggcgagcttg 29220 

ccgagccggt gccgttccgc gagttcgtgg cgcaggcgcg gctcggtgtg ccgcgcgagg 29280 
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agcacgagcg gtatttcgcg gagttgctcg gcgacgtcac cgagaccacc gcgccgtacg 29340 

gcctgaccga cgtgcacggc gacggctccc gggccgtcca ggtctcgctg ccggtcgccg 29400 

aggccctcgc cgtccgcgtc cgcgaggtgg cccggacact cggcgtcagc ccggccaccg 29460 

tcttccacct ggcctgggcg cgcgtgctga gcgtcatcgc gggccgcgac gacgtggtgt 2 9520 

tcggaaccat cctcttcgga cggatgaact cgggcgccgc cgccgaacgc gtgcccggcc 29580 

tgttcatcaa cacgctgccg gtgcgggtgc gactgaacgg cacgagcgtg ggggaggcgc 29640 

tgaccgccct gcgcgaccag atggccgagc tgatggcgca cgagcacgcg ccgctcgcgc 2 9700 

tggcgcagcg ggccggcggc gtgcccgcgg gcagtccgct gttcacgtcg ctgttcaact 29760 

atcggcacaa cgtcgcgggc ggcggcgacg gcggagcgct cgagggcgtc acgccggtgc 29820 

tgcaccgcga caccacgaac tatcccgtgg tggtctcggt cgacgacgac ggcacgagct 29880 

tcgacctggt ggtggaggcg gtcgcgcccg cggaggcggg tcgcgtcggg cggctcatgc 29940 

acgaatgcct ggccgagctg gtgggcgccc tggccggtgc gccggagacg cccctgtccc 30000 

gcgtgcgggt gatcgacgag gccgagatcg aacgggtcgt tcacagctgg aacgacacgg 30060 

ctcgcccggt agtggagtcg tcggtgccgg cgttgttcgc cgagcaggtg gcggctgcgc 30120 

cggatgcgac ggcggtggtg ggcgagggtg tgtcgtggtc ttatcgcgag cttgatgcgc 30180 

gatcggatgc gctggcccgg agcctggtgg cggccggtgt gggtgtggag tcgccggtgg 30240 

tggtggcgtt ggagcggtct cccgaggtgc tgtccgcgtt cctggcggtg gcgaaggccg 30300 

ggggcgtgtt tgttccggtg gacctttcgt ggccgcaggc gcgtatcgat gcggtggttg 30360 

ctgactgtgc cgcgcgggtt gcggtggctg accggccgat gagcgggctg acggtcgtgc 30420 

cggccgacca ggtcggggat tcggctgtcg tcctgccggc cggtccggtg ccgggtgcgg 30480 

cggtctaccg gatgtacacc tccggctcca cgggccggcc caagggtgtg gtgaccaccc 30540 

accagaacct cgtggatttg gcgaccgaca cgtgttgggg tccgaccccg cgggtgttgt 30600 

tccatgcccc gcacgcgttt gatgcgtcgt cgtatgagat ctgggtgccg ttgttgaatg 30660 

gcggcacggt cgtggtcgct ccgcagcgca gcatcgacgc gacggtcttg agggacctga 30720 

tccgcgggca tgagttgacg cacgtgcatg tgaccgccgg tctgttgcgg gtgctggacc 30780 

cgtcgtgttt tgcggggctg accgaggttt tgaccggcgg ggatgcggtg tcggcggagg 30840 

cggtgcgccg ggtccgggaa gcgaacccgg gtctgcgggt gcggcagctt tatggcccga 30900 

ccgaggtgac cttgtgcgcg acgcagcatc tgctggtcga cggggtgccg atcgggcggc 30960 

cgttggacaa cacccgcgtg tatgtcctgg atgacttgct gcagccggtc ccggtcggtg 31020 

tgactggcga gctgtatgtg gccggggccg gcttggcgcg tggctatgcg ggcatgcccg 31080 
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ggttgacggc cgagcggttc gtcgccgacc cgttctcggt tggtggtcgc ctctaccgca 31140 

cgggtgatct ggtccgctgg actgacgacg gggtgctgca tttcgccggg cgggccgatg 31200 

atcaggtgaa gattcgtggc tatcgggtgg agccgggcga ggttgaagcg gtcctggctc 31260 

aacaccccga cgtcagccag gtggcagtgg tcgtccgcga ggacacgcca ggggataagc 31320 

ggctggtcgc ctatgtcgtc ggcggtgacg tcgaggcgta tgcgcaggag cgccttccgg 31380 

gctacctggt cccgtcggca ttcgtacatc tggatgcgct gccgctgacc agcaaccaga 31440 

aggtcgaccg ggccgcactg cccgcgccgt ccgtggagtc cggcgtcggg cgggcgcccg 31500 

ccgacgcgcg tgaagagctg atgtgtgccg cgttcgctga ggtgctcgac ctggatcggg 31560 

tcggtgtcga cgacgacttc ttcgctttgg gtgggcattc gctgttggtg gtgcggttgg 31620 

tgggccgtat tcggcaggtg ttcggggtgg aggtgtcggc tcggctggtc ttcgatgcgc 31680 

ggactccggc cggtgtggtg gcccgcttgt ccgagggcgg cacggcccgg gaggccgtac 31740 

gggcgcgggt gcgtccggcg cgggtgccgt tgtcgttcgc gcaacgccgg ttgtggttcc 31800 

tgtcccagct ggagggtccg agcgcgacct acaacatccc ggtggcgctg cggctggacg 31860 

gtcctctgga tcgcgatgct ctaacggcgg cgttgcacga tgtggtggcc cggcacgagg 3192 0 

tgttgcgtac cgtcttcacc gtcgccgacg gcgagccgtg gcaacagatc ctcgacgatc 31980 

cgcaggtctc cgttccggtc gtcgaggtca cgcccgaccg gctgcccgag gcggtggccg 32040 

tcgccgcggg gcaccggttc gacctcggcc gggaactgcc gctgcgggcg gtcctgctgg 32100 

cgaccggcga cgacgtgcac gtgctggtgc tcgtggtgca tcacattgcc gccgacggct 32160 

ggtcgatgcg gccgctcgcc cgggacttgg cggccgccta cgcggccagg atcgacgcga 3222 0 

cggcgccggc cctcggcgcg ctgccggtgc agtacgccga ctacgccctc tggcagcgcg 32280 

acgtgctcgg ttccgagcac gacccggaca gcgtcatctc ccaacaggtt gcctattggc 32340 

ggcggcagct ggccggcgta ccggaggaat tggatctgcc ggtggaccgg gcgcgcccgg 32400 

ccgaggcatc gcatcgcggc cacaccgtgg agttcgccgt gcccccggcc gtgcaccacc 32460 

aactcgccga actcgcccgc cgcaacggcg tcaccgtctt catgaccgtg caaaccgccc 3252 0 

tcgccgtcct cctgtccaaa ctcggcgccg gcaccgacat ccccatcggc gtcgccgtcg 32580 

ccggacgcac cgaccccacc ctcgacaacc tcatcggctt cttcgtcaac accctcgtcc 32640 

tacgcaccga cctgaccggc aaccccacca tcaccgacct gctgcaccgc acccgcgaca 32700 

ccaccctgca cgccttcacc caccaagacg tccccttcga aaaactcgtc gaagacctcg 32760 

cacccacccg ctccctcgcc cgccaccccc tcttccaggt catgatgacc ctgcagagcg 32820 

cgtcggcgga cgaagagccg ctggcgctcg ccgggctgcg ggtcaccgac ctcccggccg 32880 

22 



gggagacacc cgccaaggtg gacctcgacc tgacgctgca cgaggtggcg ggccgagacg 32940 

gcatgcacgc cacgctcctc ggcgcggccg acctcttcga gcaggagacg gtgcgcgccc 33000 

tcgccgaccg gttgctgcga accctggaag ccatggcggc ggccccggac gaccgcctcg 33060 

accggatcga ggtgctgtcg ccgggggagc ggtcgcggct gctggtggag tggaacgaca 33120 

cggctcgtcc ggtggtggag tcgtcggtgc cggcgttgtt cgccgagcag gtggcggctg 33180 

cgccggacgc ggtcgcggtg gtgggcgagg gtgtgtcgtg gacctatcgc gagcttgatg 33240 

cgcggtcgga tgcgttggcg cggagcctgg tggcggcggg tgtgggtgtg gagtcgccgg 33300 

tggtggtggc gttggagcgg tctccggagg tgctgtccgc gtttttggcg gtggcgaagg 33360 

ccgggggcgt gtttgttccg gtggatttgt cgtggccgca ggcgcgtgtc gatgcggtgg 33420 

ttgcggactg cgg ; cgcgcgg attgcggtgg ctgaccggcc gatgagcggg ctgacggtcg 33480 

tgtccgccgg cctgggcggg gattcggccg tcgtgtccgg cgacctgacc gcggatcggg 33540 

ccgttgtgtt gccggccggt ccggtgccgg gtgcggcggt ctaccgcatg tacacctcgg 33600 

gttccacggg tcggcccaag ggtgtggtga ccacccacca gaacctcgtg gatttggcga 33660 

ccgacacgtg ttggggtccg accccgcggg tgcttttcca cgccccgcac gccttcgacg 33720 

cgtcgtcgta tgagatttgg gtgccgttgt tgaatggcgg cacggtcgtg gtggctccgc 33780 

ggcgcagcat cgacgcgacg gtcttgaggg acctgatcgg cgcgcatgag ttgacgcacg 33840 

tgcatgtgac cgcgggcttg ttgcgggtgc tggacccgtc gtgcttcgcg ggcctgaccg 33900 

aggtgctcac gggcggggat gcggtgtcgg ccgaggcggt gcgccgcgtc aaagacgcga 33960 

atccgggtct gcgggtgcgg cagctgtacg gcccgaccga ggtgaccttg tgcgccacgc 34020 

agcatctgct ggatgacggg gtgccgatcg ggcggccgtt ggacaacacc cgcgtctacg 34080 

ttctcgacga cctcctgcgg ccggtcccga caggtgtggt gggggagctg tatgtggccg 34140 

ggtcgggtct ggcgcggggc tatgcgggca tgcccgggtt gacggccgag cgattcgtcg 34200 

ccgacccatt caacaccggc ggtcgcctct accgcacggg tgatctggtg cggtgggccg 34260 

acgatggtgt gctgcatttc gctgggcggg ccgatgatca ggtgaagatt cgtggctatc 34320 

gggtggagcc gggcgaggtt gaagcggtcc tggctcaaca ccccgacgtc agccaggtag 34380 

cagtggtcgt ccgcgaggac accccaggcg acaagcgcct ggtcgcctac gtcgtcggcg 34440 

gggatgtcga ggcgtatgcg caggagcgcc ttccgggcta catggttccg tcggctttcg 34500 

tgcagttgga tgcgctgccg ctgaccagca accagaaggt cgaccgggcc gctctcccgg 34560 

cgccatccat ggagtctggc gccggccggg cacccgccga cgcgcgtgaa gagctgatgt 34620 

gtgccgcgtt cgccgaggtg ctcgacctgg atcgggtcgg tgtcgacgac gacttcttcg 34680 
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ccctcggcgg gcattcgctg ctcgccgtct ccctcgtgga gaatctgcgc cgccacggcg 34740 

ttcacatctc cgttcgggcc ctcttcgcca cccccacgcc ggccgcgctg gccgcctcgg 34800 

cgggaaccgc cgtcccggac gtgccgccca acctcatccc ccagggcggc gcccaggaac 34860 

tgacccccga catgctgccc ctggtcgacc tgaccggcga ggaactggcc accatcgtgg 34920 

ccgcggtgcc cggcggcgct cccaacatcg ccgacatcta ccccctagcc ccgctgcaag 34980 

agggcatctt cttccaccac ctcatgaccg agggcgatgc caccgacgtc tacctcctgc 35040 

cgcggattct cggcttcggc ggccgtcccg agctggacgc cttcctcggg gccctgcagc 35100 

aggtggtgga ccgccacgac gtctatcgca cggccatcgc ctggcagaac ctgcgcgagc 35160 

ccgtccaggt cgtgcaccgc cacgccaccc tgcccgtcac cgaagtcacc cccgaccagc 35220 

tgcacgccgc cgccaccggc ggccggctcc cgctcgacca cgcacccctg ctcagcgtcc 35280 

acatcgcacc cgaacccgac ggcggctggc tcgccctgct ccgcatgcac cacctcgtgc 35340 

aggaccacac cgccctcgac atcgtcctcg acgagatccg caccatcctc gccggcgcaa 35400 

ccgaccacct ccccccgccc gtaccgttcc gcgacttcgt ggcgcaggcc cgcctcggcg 35460 

tctcccgcgc ggagcaggag cgctacttcg ccggcctgct cggcgacgtc accgagacca 35520 

ccgccccgta cggcctggcc gacgtgacga acgacggcac cgcatcggtg cgggccgagg 35580 

tcgagctcga cgcggccctg gcggcccggc tgcgcgacct cgcccgcgac cggggcgtca 35640 

gcccggcgac ggtcttccat ctggcctggg cgcgcgtgct ggcggcggtg gccgaccggg 35700 

aggacgtcgt cttcggcacc gtcctgttcg gacggatggc ctccggcgcc cggcgggtgc 35760 

ccggcctctt catgaacacc ctgccggttc gcgtacgcct gtccggaacc gccgccgagg 35820 

ctctgggaca ggtgcgcgac cggctcgccg agctgatggc gcacgagcac gcgccgctcg 35880 

cgctggccca gcaggcgagc ggcctgcccg ccgggagccc gctgttcacg tcgttgttca 35940 

actaccgcta tgcccggccg ccggccgcca cgccggacga tccgctggcg ggggtgcgca 36000 

cgctgttcgc gtgggaacgc aacaactacc cggtcaccgt gtcgatcgac gacgacggca 36060 

ccggattcgc ggtcacggtc gacgtcgtgg cgccggccga cgccgacgag gtcgtccgcc 3 6120 

tgctccgcac gaccctgacc cgcctggccg ccgccctcga acgcactccc gagatgccgg 3 6180 

tggccgacgt gcggcccggc cgcgtgtccc ggcccgccgc cggccgcgcg gtgctcgtgc 3 6240 

cggtcccggc cggcgagcgg gcgaccggcg cgggccgggc tccggccacg gcgtacgagg 36300 

agctgatctg ccaggcgtac gcccaggtgc tggaggttga ccgggtggcg gccgacgacg 36360 

acttcttcgc cctgggcggc aactcgctgc tcgccacgcg gctggtcagc cggatccgct 36420 

cggcgctggg cgtggaggtc accatccgcg cgctcttcga gacgctcacc ccgcaacggc 36480 
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tggccgcccg 
ggccggagcg 
agggcagcag 
cggccgcgct 
tgatcccggc 
cggcggtgga 
ccggttacgc 
gcgcgaccga 
tggcgccgct 
cgcgctggga 
tgggcgcggc 
ccctggccgg 
cctcgcaccg 
ccgagctggc 
cgctgctgag 
gcaccgacga 
ccgacgtgtc 
tcgccgcgta 
agcgttccct 
gcgcgcggcc 
cccgctacga 
gcctggaggg 
tcggcgaccg 
gccggatcga 
cccggcgcga 
gcccgggtgc 
cgcgcgccgg 
tggccgtgct 
ccggcggcac 
tccgggacgc 



gctgacccgc 
gattccgctg 
cgccacctac 
caccgcggcg 
cgaggacggc 
gatcgtcgag 
gttcgacctc 
ccacgtcctc 
cgcccgcgac 
gccgctgccg 
cggtgacccg 
gatgccgccg 
cggcggcgag 
cgtcgccgag 
ccggctgggt 
cgccctcgac 
cggcgacccc 
cgcccatcag 
ggcccggcat 
gcggctggcc 
cctcacgctc 
gcatctcgag 
gctgatccgg 
tctgatggcc 
cgtgccggcg 
ggtcgcgctg 
gcggctggcc 
gttgccgcgc 
ctatctgccg 
ggagccggcg 



gcctcggcac 
tccttcgcgc 
agcaacacga 
ctgcacgacg 
cggccgtacc 
gtcgctcccg 
gccgccgaga 
gtcctggtga 
ctcgccgccg 
ctgcagtacg 
gagagcctgc 
gagatcccgc 
gtgccgatcg 
cgggccaccc 
gccggcaccg 
gagctggtcg 
gggttcggca 
gacgttccct 
ccgctgttcc 
ggcctcgaag 
accgtgaccg 
ttcagccgcg 
ctcctcaccg 
cccgcggagc 
gccaccgtgc 
cggtcggagg 
gcggtgctgc 
ggcgtcgagc 
atcgatcccg 
tgcctgctga 



ccgggcgggt 
aacggcgcct 
ccgcgctgcg 
tgatcggccg 
agctggttct 
gcgagcttgg 
taccggtccg 
tccaccacat 
cctacgaggc 
ccgactacgc 
gcgagcgcca 
ttccggccga 
ccatacccgc 
tcttcatggt 
acgtcccgat 
gcttcttcgt 
cgctgctgcg 
tcgaccaggt 
aggtggcgct 
tcggcaccga 
agcggcgcga 
acctgttcga 
ccgcggtggc 
gccgcaacgt 
cggagctggt 
acggcgagat 
gccgccgcgg 
aggtggtcgc 
cgtacccgcg 
ccgtcgccgg 
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cgcacccgcg 
gtggttcctg 
gctctccggc 
gcacgaggtg 
cccgcccgag 
cgcggccgtc 
cgcccggctg 
cgccaccgac 
ccggctggcc 
gctctggcag 
gctcgcctac 
ccgttcccgc 
cgacctgcac 
tctgcaggcc 
cggcaccgcc 
caacatgctt 
ccgggtgcgc 
cgtcgaggag 
gaccgtgcag 
gccgatcgag 
cgagcacggc 
cgcgccgacc 
cgacccggag 
gctcgagggc 
ggccgcgcag 
cacgtacgcc 
gatcgggccg 
cttcctcgcc 
cgaccgcgtc 
gcatcgcgcg 



ccgaggacac 
ggcgagctgg 
gagctcgacc 
ctgcgcacgg 
gaggcacggc 
gacgaggtgg 
atccggctgg 
ggatggtcga 
ggccgggcac 
gaggagttgc 
tggcgcgaca 
ccgccggtgg 
cgccgcctgg 
ggcttcgcgg 
ctcgccggcc 
gtgctgcgca 
gagaccggcc 
ctggtgaccg 
aacgcgccgg 
cacggcatcg 
gcgccggacg 
gtcgcgaccc 
ctgccgctga 
tggagcaccg 
gtggcccggc 
gagctggacg 
gagtcgcggg 
gtggtgcggg 
gactacctcg 
gcggcgccgg 



36540- 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 



cggcaccggc ggtggtcgag ctggacgacc cggcgaccgc ggccgagatc gccgatgccg 38340 

aaccggagcc gccggtcgcg gtgcggccca cccactccgc ctacctgatc tacacctcgg 38400 

gctcgaccgg gcggcccaag ggcgtcgtgg tcacccaccg gggtgtggcc gcgctcgtgg 38460 

ccacccaggc cgagcggctc gcggtgaccg gcgagagccg cgtcctgcag ttcgccagcg 38520 

tgggcttcga cgcctcgatc tgggagatgg tgatggccct gtgcgccggc gccaccctgg 38580 

tcgtcgcccc ggccgacgac ctgctgcccg gcccggccct ggccgccacg ctgtccgggc 38640 

acgcggtgac ccacgcgacc ctgccgccgg ccgtgctcgc cgcgtccgcg cccggcgatc 38700 

tcgcgccgct cgccgtgctg gtctcggccg gtgaggcgct cgggccggac ctcgtccggc 38760 

agttcgcgcc cggccgcgcg ctggtgaacg cgtacgggcc gaccgagacc acggtgtgcg 38820 

ccacggcgtc cgccccgctc ggcccggagg atcccccgca catcggcgcc ccggtcgccg 38880 

attcccgggt ctacgtgctc gacgacgcgc tcaccccggt gccgcccggg gtcaccggcg 38940 

agctgtacgt ctcgggggcg tcgctggccc gcggatatgc cggtcgcgcg gccctcaccg 39000 

cggaacgctt cgtggcctgc ccgttcgcgc cgggtgagcg gatgtaccgg accggggacc 39060 

gggcccgctg ggacgcggcc ggccggctca cgttcgccgg gcgcgccgac gaccaggtca 39120 

agatccgcgg gttccgggtg gaaccgggtg aggtggccgc ggtgctcggc gaacacccgg 39180 

cggtcgcccg ggcggcggtc gtggcgcgca cggacgggcc gcagggcgcg cggctggtgg 39240 

cgtacctcgt cgccgccgac ccggccgggc ccgacctggc cgctgcggtg cgcgcgtacg 39300 

ccgccgcgac cctgcccgcg cacctgcttc cggccgcgtt cgtgccgctc gaccgcctgc 39360 

ccctgaccac gaacggcaag ctggaccggg ccgcgctgcc cgaaccggag accggcgccg 39420 

ggcgcgagcc gtccggcccc gtcgaacggc tgctctgcga ggcgttcgcc gacgtgctcg 39480 

gcctcgaccg ggtcggcgcc gacggccact tcttcgacct gggcggccat tcgctgctcg 39540 

ccacccggct gctcagccgg ctccgctcgg ccgcgggcat cgacgtcccg gtccgggtgc 39600 

ttttcgagaa ccccactccc gccgggctcg ccgcctgggt ggagacccac gccggatccc 39660 

gccggaagtc ccggccggcg ctgcggccga tgcgtcacca gaaggagtcc tgatgatccc 39720 

cctgtcgttc gcgcagcgcc gcctgtggtt cctcggccgg ctcgaggggc cctccgccac 39780 

ctacaacatc ccgctcgtgc tgggcctgac cggcaccgtc gacgccgccg ccctcgaaac 39840 

cgccctgcgc gacgtgctgg agcggcacga ggtgctgcgt accgtctatc cggacgccgg 39900 

cggcgagccg caccagcgga tcctgccgct cggcgagacc ggcttcggcc tgcgggtcgc 39960 

cgaggtgacg gacggcgagc tggacgcggc cgtcgcggac gccaccgggc acgccttcga 40020 

cctcgcgacc gagatcccgg tccgggcctc gctgctcacc gtcgagccgg gccggcacgt 40080 
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cctggcgctg gtgctgcacc acatcgcggc cgacggctgg tcgatggggc cgctgctgcg 40140 

cgacctgtcc accgcgtaca cggcccggct ggccggcggg gaaccggcct ggtcgccgct 40200 

gccggtccag tacgcggact acgcgctgtg gcagcaggag gtgctcggcg ccggtgacga 40260 

cccggagagc ctcctgcgcg agcaggtcgg ctattggcgg tcggcgctcg ccggagcccc 40320 

cgaggagctg cgcctgccgg ccgaccaccg gcgcccgccc gtgtcgtcgt cccgggcgca 40380 

catggccgag ttcgccgtgc cggccgccgc ccacggcgac ctgaccgccc tcacccgcga 40440 

gctcggcgcc acgctcttca tggccgtgca cgcggcgacc gccatggtgc tttccgggct 40500 

gggcgcgggc gacgacctgc cgatcggcac ggtggtggcc ggccgcaccg acgccggcct 40560 

cgacgacctc gtcggctgct tcgtcaacaa cctggtgatc cgggccgacc tgaccggcga 40620 

cccgaccttc gcggacctgc tgcggcaggt ccgcgagcgg gccctcgacg cgtacggcca 40680 

ccaggacgtg ccgttcgaga agctcgtcga ggagctcgcg ccgtcgcggt cgctgagccg 40740 

ccacccgctg ttccaggtgg ccgtcgccgt ggagaccgac gacctgatcg gcggtcgcgg 40800 

cggcggtccc gccctgcggc tgcccggcct cggcatcgag gtgctgcccg gcgagccctc 40860 

cgctcgcgac ctcgacctcg acctggtggt gcgcgagacg ttcgacgccg agggacgccc 40920 

cgccggtctc accggggcac tgatcggcgc ggccggcctg ttcgacgccg cgtcggtgga 40980 

gcggctggcc gcgctgctgg cccgcgcgct cgaggcgctg gccgccgacc cgcgcacgcg 41040 

cgccggcgac ctcgacctgc tctccccggc ggaccgccgg ctgatcctgc gcggctggaa 41100 

cgacaccgcg gctccggcgc cggccggact ggtgccggac ctgttcgccg cccaggccgc 41160 

gcgcaccccg gacgcggtcg cggtcgccgg gcccgaccgg gagctgacct atgccgagct 4122 0 

ggacgagcgc tccggccgcc tcgcgcgctg gctgatccgg cgcggggtcg ccgccgacac 41280 

ccgggtcgcg ctggtgctgg agcgctccgc ggagctgccg gtggcgatcc tcgccgtgct 41340 

caaggccggc ggcgcgtatc tgccgatcga tccggcgcag ccgccgcgcc gcatcgccga 41400 

catcgtggcc gacgccgccc cggcgctcgt gctggcccag gcgtccaccg ccgacgtcgt 41460 

ggccgacgcc tctccggcgc tcgtgctggc cccggcgtcc gacggtgtgc ccaccggcgc 41520 

cgtgcccgtg cacctgctcg actcgcccgc cgtgcgtgac gaggtcgcgc agtgcccggc 41580 

cggggccgtg accgacgccg accggcgggg cgtcctgctc ggcggtcacg cggcctacgt 41640 

catctacacc tcgggatcga ccggacgccc caagggtgtc gtcgtttcgc acgacgcgtt 41700 

cgcgaacctc gtcctggacc agcgccggct cggcatcggg ccgggcagcc gggtggcgca 41760 

gttcgcctcg ccgggcttcg acatgttcgt cgacgagtgg tcgatggcgc tgctcgccgg 41820 

cgccgccctc gtgatcgtgc cgccggagcg ccggctcggc gcggacctcg ccgcgttcct 41880 
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caccgagcgc ggggtcaccc acgccacgct gccgccggcg gtggtggcga cgctgccgga 41940 

ggagtcgctg ccacgctcgt tcgtgctgga catcggcggc gacgcgctgc cggacgacct 42000 

ggcccgccgg tggctgcgcg acggccggtg gctgggcaac tcgtacggcc cgacggagac 42060 

cacggtcaac gcggcgacgt ggcgctgcga gcccggcacc tgggagggcg cgaccccgat 42120 

cggccggccg gtcgccaacc tgcgggcgta cgtgctcgac ggccgcctgc ggccggtgcc 42180 

ggtgggcgtg gagggcgagc tgtacgtctc gggcgccggc ctcgcccgcg gctatctgaa 42240 

ccgggccggc ctgaccgccg gcagcttcgt ggcctgcccg ttcgagccgg gggagcggat 42300 

gtaccgcacc ggcgacatcg tgcgctggga cgcgcggggc cgcctcgtct acgccggccg 42360 

cgccgacgac caggcgaaga tccgcggttt ccgggtcgag ccgggcgagg tggaggccgt 42420 

gctggccgcc ggtccgggcg tgaaccaggt cgcggtgatc gtgcgtgagg acgtgccggg 42480 

tgacaagcgc ttggtggcgt acgtcgtggg cggcgacgtg gagaccctcc ggtcgtacgc 42540 

gcagcagcgg ttgcccggat acctcgtgcc gtccgcgatc gtcgcgctgg ccgagctgcc 42600 

gctgacaccg agcgccaagg tggaccggcg ggctctgccg gtgccggact acggccggga 42660 

cgccggtggt gggcgggcgc cggccaacgc tcgcgaggaa gtgttgtgcc gggcgttcgc 42720 

cgaggtgctc ggcgtcgagc gggtgggtgt ggaggacgat ttcttcgcgc tgggtggtca 42780 

ttcgctgctg gtggtctcgc tggtggagcg gctgcgccgg caggggatct cggtgccggt 42840 

gcgggcgttg ttcaccacgc cgaccccggc cgggctggcc gaggcggtcg gtgatggtgc 42900 

cgtggtggtg ccgccgaacc tgattcccga gggtgcggcg gagctgaccc cggagatgct 42960 

gccgctggcc gatctgaccg ccgacgaact tgctgttgtc gtggattcgg tgcctggtgg 43020 

tgcggcgaac atcgcggatg tgtatccgtt ggcgccgttg caggagggca ttttcttcca 43080 

tcacatgatg gccgaccggg attcggcgga cgtgtatgtg acgccgacgg tggtggagtt 43140 

cgactcccgg gaccggttgg acggcttcct ggccgccttg cagcaggtcg tcgaccgtac 43200 

ggatgtgtat cggacgagtg tggtgtggca ggggctgcgc gagccggtgc aggtggtgtg 432 60 

gcggcacgcg cgcctgcccg tcgacgaggt ggtgctgcgg gacgacctcg acccggtcga 43320 

gcagctgaac gcgctcggca cggcctggat ggacctgtcc gaggcgccgc tggtgcaggc 43380 

cgtcgtcgcc gcccgccccg gcgatccgca gcgctggctc gccgtgctgc gcatccacca 43440 

cctcgtgcag gaccacaccg ccctcgacat cctcctcgag gagctggcgg cgtacctggc 43500 

cggccgcggc ggcgacctgc ccgagccggt gccgttccgc gagttcgtcg cgcacacccg 43560 

cctcggcgtg ccccgcgagg agcacgagcg ctacttcgcc gggttgctcg gcgacgtcac 43620 

cgagaccacc gcgccgtacg ggctcctcga cgtgcacagc ggcggtctcg cctcggcgca 43680 
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ggcccacctg cggctggacg gcccgctcgg ccggcgcgtg gccgccttcg cccgggaaca 43740 

cggcgtcagc ccggcgacgc tcttccacct cgcgtgggcg cgggtgctcg gcacgctggc 43800 

cgggcgtgac gacgtcgtct tcggcacggt cctgttcggg cggatgaact cgggcgccgg 43860 

cgccgaccgc gttcccggcc tgttcatcaa cacgctgccg gtgcgggtgc ggctcggcgc 43920 

gcccgtcggc gacgccctcg acggcctgcg tgaccagctc atcgagctca tcgcccacga 43980 

gcacgcgccg ctggccgtgg cccagcaggc cgcgaacctc ttcggccggc cgctcttcac 44040 

ctccatcttc aactatcggt acgcccgggg ggccgagccg gccggcgccg cgctcgacgg 44100 

catccgcctg ctctccgccc gcgacctcac caactatccg ctggcggtgg ccgtcgacgc 44160 

ggagggcgac acgttctcgc tcaccgtcga cgcggtggcg ccggccgacc ccgtgcaggt 44220 

cggcgagctg ctcgtcaccg cgctgcgcaa cctgacccgg accgccgaga acgcgcccgg 44280 

aacgccgctg gccgcggtcg gcgtgctggg cgaggacgag ctgagccggg tcgtctccgg 44340 

ctggaacgac accgcccgcc gggtccggca ggcgtcggtg cccgagctct tcgcggagcg 44400 

ggtggcggcc gcgcccggcg cgccggccgt cgccgccggg gacctgcgct ggacgtacgc 44460 

ggacctcgac gcccgttccg acgcgctcgc gcggagcctg gtggcggccg gggtgaccgc 44520 

ggagtcgccg gtcgtcgtcg ccctcgagcg ctccgcggac gtgctgaccg cgttcctcgc 44580 

cgtcgcgaag gccggcggtg tcttcgtccc ggtggacctc tcctggcccc gggcccgcgt 44640 

cgacgcggtg atcgccgact gcgccgcctg gatcgcggtg gccgaccggc cgatgaccgg 44700 

cctgaccgtc gtgcccgcca accgggccgg cgatcccgcg gtcgcgctgc cgccccgccc 44760 

cctgccgggc gcggcggcct accggatgta cacctccggc tccacgggcc ggcccaaggg 44820 

cgtggtgacg acccatcaga acgtcgtcga cctggtcacc gaccggtgct ggggcccgac 44880 

gccgcgggtc ctgttccacg ccccgcacgc cttcgacgcc tcctcgttcg agctctgggt 44940 

gccgctgctg accggcggca cggtcgtggt cgcaccgggg gagagcatcg acaccggtgt 45000 

gctgcggcag ctgatccggg cccacgagct gacccacgtg cacgtcaccg cgggcctgtt 45060 

gcgcgtgctg gccgaggacc cgtcgtgctt cgccgggctc accgaggtgc tcaccggcgg 45120 

cgacgtggtc ccggccgagg cggtgcgccg cgtgctggac gccaatcccg gcgtgcgggt 45180 

gcggcagctg tacggcccga ccgaggtgac gctctgcgcc acccagcacg tggtgcgcga 45240 

gcccagcccg gtgctgccca tcgggcggcc gctcgacaac acccgcgtct acgtgctcga 45300 

cgggcttctc cagccggtcc cggtcggtgt caccggcgag ctgtacatcg ccggcgccgg 453 60 

cgtggcccgg ggctacgccg acatgccggg caccaccgcc gagcggttcg tcgccgaccc 45420 

gttcaccgcc ggcggccgcc tctaccgcac cggtgacctg gtccgctgga ccggcgaggg 45480 
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cgagctggtg ttcgccggcc gggccgacga ccaggtgaag atccgcggct accgcgtcga 45540 

gccgggcgag gtggaggcgg tcctcgccgc gttgccgggc gtcagccagg cggcggtcat 45600 

cgtccgcgag gacgtacccg gcgacaagcg gctggtggcc tacctggtcg cggcgccgga 45660 

gacggtcgag gccgcccgcg cccacgccga gcagcggctt ccgtcctatc tcgtcccgtc 45720 

cgcgttcgtg cagctggacg cgctgccgct gaccggcaac cagaaggtcg accgggcggc 45780 

gctgccggca ccgctggggt tcgaagccgg tgccggccgg gcgccggcgg acgcccgcga 45840 

ggagctggtc ggcgccgcct tcgccgaggt gctcgacctc ggccgggtgg gccccgacga 45900 

cgacttcttc gcgctcggcg ggcactcgct gctcgccctc gcgctggtgg agcgcctgcg 45960 

ccggcagggc ctgggcgtct cggtgcgtgc cgtcttcgac gcacgcaccc ccgcggcgct 46020 

gacccgccgc ggcgacggcg gtgccgacga ccggccggcg ctgcgggccg gtgcgcggcc 46080 

cgcgcggctg ccgctttcct acgcgcagcg ccggctgtgg ttcctggccc agctggaagg 46140 

accgagcgcc acctacaaca tcccggtcgc gctgcgcctg gagggcgacc tcgaccggga 46200 

tgccctgacc gccgccctgc gcgacgtggt ggcccggcac gaggtgctgc gcacggtgtt 46260 

cacggtcgcc gacggcgagc cgtggcaaca catcctcgac cccgcgcggg ccgagcccgc 46320 

gttgccggtc gtggacgtgc cggccggccg ggtcgaggag gcggtcgccg aagcggccgc 46380 

gtacgccttc gacctggccc gggagatccc gctgcgtgcc gtgctgctcg cccccggcga 46440 

cggcacccac gtgctcgtgc tggtgctgca ccacatcgcg gccgacggct ggtcgatgcg 46500 

gccgctggcc cgcgacctgg cgaccgccta cgccgcgcgg cggcgggggc aggcgcccga 46560 

gtcggagacc ctgcccgtcc agtacgccga ctacgccctc tggcagcgtg acctgctggg 46620 

ctccgacagc gacccggcga gcctgatctc ccggcagatc gcccactggc gcgagcggct 46680 

cgacggcgtg ccggaggagc tggacctgcc cgccgaccgg ccgcggcccg ccgcggcctc 46740 

gcaccgcggc cacctgcaca gcgcggagat cccggccgac gtgcaccgga gcctgcgccg 46800 

ggtcgccgcc gaccacggcg cgaccgtctt catgaccctg caggccgccg tggcggtcct 46860 

gctgtcgcgg ctcggcgcgg gcaccgacgt cccgatcggc accgtcgtcg ccggccgcgc 46920 

cgaccgggcg ctggagaacc tggtcggctt cttcgtcaac acgctcgtgc tgcgcaccga 46980 

cctgaccggc gacccgcggc tgaccgacgt gctcggccag gtgcgcgagc tgaccctgcg 47 040 

ggcgctggcc caccaggacg tcccgttcga gaagctggtc gaggagctca ccccggcccg 47100 

ctcgctcgcc cggcaccccc tgttccaggt catggtcacc ctggacggcg gcgggccgga 47160 

cggcgccgag ctgccgggcc tggcgatgtc ggtcgtgccg accggcgccg ttccggccaa 47220 

gttcgacctc gacctcacgt tcaccgagac cttcgacgcc gcgggggagc cggccggcct 47280 
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gcgcgtcgac ctcatcgcgg cggccgacct cttcgacgcg ggcacggccg cccggctcgc 47340 

cggctacctg agccgcgttc tcggcgtgct cgcggccgat ccgcggcgcc gcctggccga 47400 

ggtcgacccg ctggaggcgg aggagagccg gctcatgctc gccgccggtg aggagcccgc 47460 

gccggccctg cccgagatca ccgtcgcggc gctcgtcgcc gagcagtgcg cccgcacgcc 47520 

gggtgcggtc gcggtgaccg gaccggacgc gagcctgacc tacgccgagc tcgacgagcg 47580 

ggcggcccgg atcgcccgct ggctgcgccg ccacggtgcc gggcccggcg cggccgtgtg 47640 

cgtcctgatg gaacggtcgg cggagctggt cgccgtgctg ctcggcgtga tgcgcgcggg 47700 

tgcggcgtac gtgccggtcg accccgccta tccggccgag cggatccggt tcgtcgtcac 47760 

cgacgcccgg gccgcctgcg tggtgagcga gtcggcctcg gccggcctcg tcccggacgg 47820 

ggtgccgtgc ctggcgatcg acgacccggc cgccgccgcg gaaccggccg agcccggcga 47880 

cgacccgggc gacgcggccg ggccgcggcc ggacgatccg gcgtacatca tctacacctc 47940 

cggatcgacc ggcaccccca agggcgtcgt ggtctcgcac cgcaacgtcg tggcgctgct " 48000 

gaccgccacc cggccgctgt tcggcttcgc cggcgacgag gtgtggtcgt ggttccactc 48060 

ggtcgcgttc gacttctcgg tgtgggagct gtggggcgcg ctcacccacg gcggccgggt 48120 

cgtcgtcgtg ccctacgcgg tgtcccgctc gccgcgcgac ttctgggagc tcgtcgtccg 48180 

cgagggcgtc accgtgctga gccagacacc gtcggccttc gcgcagctca tggccgcggc 48240 

gggggacgac gaccgggacg cgctgcggtt cgtcgtcttc ggcggcgagg ccctcgaccc 48300 

gggccggctg gccggctggc tggcccgccg cccggacaag ccgcgcctgg tcaacatgta 48360 

cggcatcacc gagacgaccg tgcacaccac gtaccagcac atcgcgcccg gcacgacggg 48420 

cagcgtcatc ggccgcggac tgcccggctt cggcctctac gtgctggacg aggcgctgcg 48480 

cccggtgccg gccggcgtgc ccggcgaggt gtacgcccgc ggcccgcagg tggcccgcgg 48540 

ctacatcggc cgccccggcc tgaccgcgga gcggttcgtc gcctcgccct tcgcgcccgg 48600 

cgagcggatg taccgcaccg gcgacgtggc ccgctggacc gccgacggcc gcctggtgtt 48660 

cgccggtcgc tcggacgacc agatcaagat ccgcggtttc cggatcgagc ccggcgaggt 48720 

ggaggccgtg ctggccgccg gtccgggcgt gagccaggcc gcggtgatcg tgcgtgagga 48780 

cgtgccgggt gacaagcgct tggtggcgta cgtcgtgggc ggcgacgcgg agaccctccg 48840 

gtcgcatgcc cagcagcggt tgccgggtta tctggtgccg tcggcgttcg tggagctgga 48900 

ccggttgccg ttgacggtca acggcaagct cgatcgccgg gctctgccgg tgccggacta 48960 

cggccgggac gccggtggtg ggcgggcgcc ggccaacgct cgcgaggagg tgttgtgccg 4902 0 

ggcgttcgcc gaggtgctcg gcgtcgagcg ggtgggtgtg gaggacgatt tcttcgcgct 49080 
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gggtggtcat tcgctgctgg tggtctcgct ggtggagcgg ctgcgccggc aggggatctc 49140 

ggtgccggtg cgggcgttgt tcaccacgcc gaccccggcc gggctggccg aggcggtcgg 49200 

tgatggtgcc gtggtggtgc cgccgaacct gattcccgag gacgcggcgg agctgacccc 49260 

ggagatgctg ccgctggccg atctgaccgc cgacgaactt gctgttgtcg tggcgtcggt 49320 

gcccggtggt gcggcgaaca tcgcggatgt gtatccgttg gcgccgttgc aggagggcat 49380 

tttcttccat cacatgatgg ccgaccggga ttcggcggac gtgtatgtga cgccgacggt 49440 

ggtcgagttc gactcccggg accggttgga cggcttcctg gccgccttgc agcaggtcgt 49500 

cgaccgcacc gatgtctacc gcaccagcgt ggtgtggcag gggctgcgcg agccggtgca 49560 

ggtggtgtgg cggcacgcgc gcctgccgat cgacgaggtc gagctgcacg agggcaccga 49620 

tccggccgag cagctgatcg cgctcgccac cgagcgggtg gacctcgacc gcgcgccgct 49680 

gatccgcacg acgaccgcgg ccgtgcccgg atccggccgg tggctcgcgc ttctgcgcat 49740 

ccaccacctc gtgcaggacc acaccaccct ggacgtgctg ctcggcgagc tgcgggcctt 49800 

cctcgagggc cgcggcgacg agcttcccga gccggtgccg ttccgcgagt tcgtggcgca 49860 

ggcgcggctc ggtgtgccgc gcgaggagca cgagcggtac ttcgcggagt tgctcggcga 49920 

cgtcaccgag accaccgcgc cgtacggcct gaccgaggtg cacggcgacg gttcggccgc 49980 

cgtgcacagc cggcgcgagg tggacgacga cctcgccgcg cgcctccacc ggctggcccg 50040 

gtcgctcggc gtcagcccgg cggcgctctt ccacctcgcg tgggcgcggg tgctcggcgc 50100 

cgtgtcgggc cgggacgacg tcgtcttcgg cacggtcctg ttcgggcgga tgaactccgg 50160 

cgccgccgcc gaccgcgtgc agggcttgtt catcaacacg ctcccggtgc gcgtgcggct 50220 

cgccgccggc agcacccgcg acgccctgac cgggctgcgg gaccagctgg ccgggctgct 50280 

ggtgcacgag cacgcgccgc tcgcgctggc gcagcgcgcg gccggcatca ccgacggcag 50340 

cccgctgttc gcgtcgatct tcaactaccg ccacaaccag gacgacccgg cggcgtcggc 50400 

cgggctcgag ggcatccgca cggtctacag cgccgagcac accaactacc cgctcgacgc 50460 

ctcgatcgac gtcaccggcg accgcttcgc catcaccgtg aacgcggtgg cgcccgcgga 50520 

cgccgcgcgg atcgctgagc tgatgcacac ctgcctcggc cacctcgcgg acgtgctcga 50580 

agacgcgccg gagacgccgc tgtcgtgggt cagcccgctg agcgcggagg atctcggccg 50640 

catcgtgggc gactggaacg agacgcggcg cgcggtcacc cgcgcgtccg tgccggagct 50700 

gttcgccaag caggtggccg ccacgccgga cgcgatcgcg gtggcgggcg agggtgtgtc 50760 

gtggtcctat cgcgagctcg atgtgcgctc ggatgcgctg gcccggagtc tggtggcggc 50820 

cggtgtgggt atcgagtcgc cggtggtggt ggcgctcgat cggtctccgg aggtgccgac 50880 
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ggcgttcctc gcggtggcga aggccggcgg tgtgttcgtc ccggtggact tgtcgtggcc 50940 

ccaggcgcgt gtcgatgcgg tgatcgccga ctgcgccgcg cgggtggcgg tggccgaccg 51000 

gccgatgacc gggctgacgg ttgtgcccgc cgacgcggcc ggcgacccgg ctgccgagtt 51060 

gccgccccgc cccttgccgg gtgcggaggt ctaccggatg tacacctccg gctcgacggg 51120 

ccggcccaag ggtgtggtga ccacccacca gaacctggtg gatctggcga ccgacacgtg 51180 

ttggggtccg accccgcggg tgcttttcca cgccccgcac gccttcgacg cgtcgtcgta 51240 

cgagatctgg gtgccgttgc tgaatggcgg cacggtcgtg gtcgcgccgg ggcggagcat 51300 

cgatgccgcc gtgctcggcg agctgatccg ggcgcatgag ttgacgcacg tgcacgtcac 51360 

cgcgggcctg ctgcgggtgc tggacccgtc gtgcttcgcg gggctgaccg aggtgctcac 5142 0 

gggcggcgat gcggtgtcgg ccgaggcggt gcgccgggtg atggaggcga acccgggcct 51480 

gcgggtacgt cagctgtacg gcccgaccga ggtgacgctg tgcgccacgc agcaggtgct 51540 

cgatggcacg ggcgtgccga tcgggcggcc gttggacaac acccgcgtgt atgtcctgga 51600 

tgacttgctg cagccggtcc cggtcggtgt gaccggcgag ctgtatgtgg ccggtgccgg 51660 

cttggcgcgc ggctatgcgg gcatgcccgg gttgacggcc gagcggttcg tcgccgaccc 51720 

gttcagcagc ggcggtcgcc tctaccgcac gggtgatctg gtgcggtgga ccgatgacgg 51780 

tgtgctggtg ttcgcgggcc gggccgatga tcaggtgaag attcgtggct atcgggtgga 51840 

gccgggcgag gtcgaggctg tcttggccgc gcatccggac gtggctcagg tggcagtggt 51900 

cgtccgggag gacaccccag gggataagcg gctggtcgcc tacgtcgtcg gcggcgatgt 51960 

cgaggcgtat gcgcaggagc gccttccggg ctacctggtc ccgtcggcct tcgtccatct 52020 

ggacgcgctg ccgctgacca gcaaccagaa ggtcgaccgg gccgcactgc ccgcgccgtc 52080 

cgtggagtcc ggcgcgggcc gggcgcccgc cgacgcgcgt gaagagctga tgtgtgccgc 52140 

gtttgccgag gtgctcgacc tggatcgggt cggtgtcgac gacgacttct tcgctttggg 52200 

tgggcattcg ctgttggtgg tgcggttggt gggccgtatt cggcaggtgt tcggggtgga 52260 

ggtgtcggct cggctggtct tcgatgcgcg gactccggcc ggtgtggtgg cccgcttgtc 52320 

cgagggcggc acggcccggg aggcggtgcg ggcgcgggtg cgtcccgcgc gggtgccgtt 52380 

gtcgttcgcg caacgccggt tgtggttcct gtcccagctg gacggcacga gcacgaccta 52440 

caacatcccg gtcgcgctgc aactcgacgg cccgctcgat cgggacgcct tcaccgcggc 52500 

actgcacgat gtggtcgccc ggcacgaggt gctgcgtacc gtcttcaccg tcgccgatgg 52560 

cgagccgtgg caacacatcc tcgacacgcc gtcggtgagc gtccccgtca tcgaggtgcc 52620 

cgccgacggg cttccggagg cggtggccgc ggcggccgcg cacaccttcg acctgagccg 52 680 
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ggagatcccg ctccgggcgg tgctgctcgc caccggcgcc gaccggcacg tgctggtgct 52740 

ggtcgtgcat cacatcgccg ccgacggctg gtcgatgcag cccctcgccc gggacctcgc 52800 

cgtcgcctac gccgcccgga tccggggcga ggcgccggcc tggaccgccc tgcccgtcca 52860 

gtacgccgac tacgccctgt ggcagcgcga cgtgctcggc tccgagcacg acccggacag 52920 

cgccatctcc cagcaggtcg cccattggcg gcgacagctc gccggagccc ccgacgagct 52980 

accgctgccc gccgaccacc cccgtcccgc cgaggccacc taccgcggcc acaccgtgga 53040 

gttcaccgtg cccccggccg tgcaccacca actcgccgaa ctcgcccgcc gcaacggcgt 53100 

caccgtcttc atgaccgtgc aaaccgccct cgccgtcctc ctgtccaaac tcggcgccgg 53160 

caccgacatc cccatcggcg tcgccgtcgc cggacgcacc gaccccaccc tcgacaacct 53220 

catcggcttc ttcgtcaaca ccctcgtcct acgcaccgac ctgaccggca accccaccat 53280 

caccgacctg ctgcaccgca cccgcgacac caccctgcac gccttcaccc accaagacgt 53340 

ccccttcgaa aaactcgtcg aagacctcgc acccacccgc tccctcgccc gccaccccct 53400 

cttccaggtc atgatgaccc tgcagagcac cgggcgggcc ggcgaggcgg ccgagctgcc 53460 

cggcctggag acggcggtgc tgtcgccggg cggcgtcgcc gccaaggtcg acctcgacct 53520 

gagcctgagc gaggcgtacg acgacgacgg ccgcccggcg ggtctcgccg gaacgctcgt 53580 

cgcggcggcc gacctgttcg agcacggcac cgccgagcgg atcgccggtt acctcgcgcg 53640 

gctgctcgcc gtgctgcccg ccgatcccgg cgcccggctc ggcgacgtgg acctgctcga 53700 

cggcgaggag cggcggctgg tcctcaccgg ctggaacgac acgacggcgg ccgtgccggc 53760 

ggtggcggtg cccgagctga tcgagcggcg tgccgccgcc gaaccggagg ccggcgccgt 53820 

ctggtgcggc gacacgcacc tgcggtacgg cgagctgaac gcccgcgcga accgcctcgc 53880 

ccggctgctc gtggagcgcg gggcgggacc cgagtcgatc gtcgcggtct gcctggaacg 53940 

ctcggccgac ctcgtcgtca cgctgctggc cgtgctgaag accggggccg cctacctgcc 54000 

gatcgatccc ggatatccgg ccggccggat cgcctacatg ctcgccgacg cccgccccgc 54060 

gctgctcgtc acgagcccgg cggtcgcctc cggtgacagc ctcccggacg gtggcgcgca 54120 

acggatcgtc ctcggcgatc cggacaccgc ggcggccctc gacggcctcg ccggcaccga 54180 

cctgctcgtc tcggagcggc gcggcgtcac gcacccggca catccggcgt acgtcatcta 54240 

cacctccggg tcgaccgggc gccccaaggg tgtcgtcgtg ccgcacgggg ccctcacgaa 54300 

tttcgtggcg gcgatgagcg accggctcgc gctgggcgcc ggcgaccggc tgctcgcggt 54360 

caccacggtc gccttcgaca tccacgtcct ggagctctac gtgccgctgg tcggcggcgc 54420 

cggagtggtc gtcgccgagg acgccgtggt gcgcgacccg gccgcggtcg ccgcgctcct 54480 
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cgaccggcac 
cgggcacgcc 
cgcgctcgcc 
cgaggtcacc 
gatcggcacg 
gccgccgggc 
cggccgggcc 
gatgtaccgc 
ccgcgtcgac 
ggcgcttgcc 
gggcgaccgc 
ggcggcggtc 
cgtggtcctg 
ggcacccgac 
catctgcgcg 
cttcgccctg 
cggcatcgcg 
cgcggcccgc 
cgccaccgcg 
ccgggtggtc 
gccgctgcag 
gctgccggcc 
gcaacacgtg 
cgagccggtg 
cgcgggcgcc 
ccgggccccg 
gctgctccag 
gatccaggcc 
ctacgtcgcc 
gctgctcggt 



gccgtgacga 
gacgccgtcc 
ggccggatgg 
gtgtgggcga. 
cccctgcgca 
gtgccgggcg 
ggcctgaccg 
accggcgacc 
gaccaggtca 
gcgttcgggc 
cggctcgtcg 
cgcgcgcacg 
cccgacctgc 
tacggcgccg 
gtgttcgccg 
ggcggccatt 
gttccggtcc 
gccgagggcc 
ctcacgccgg 
gccgccgtgc 

gaggggctgc 

gtcctcggat 
atcgaccggc 
caggtggtct 
gacccggtgc 
ctgctgcggg 
atccaccacc 
ttcctcgccg 
cgggcccggc 
gacgtcaccg 



tcgtgcaggc 
gcgacgtgcg 
ccgcggccgg 
ccgtcgccga 
acacccgcgc 
agctctacct 
ccgagcgctt 
gggtccggtg 
agatccgcgg 
cggtggcccg 
gctatgtcgt 
tcgccgccca 
cgctgaccgc 
cctccgccgg 
agacgctcgg 
cgctgctggc 
gcgccctgtt 
cggacgagcc 
cgatgctcac 
ccggcggggc 
tcttccacag 
tcgattcgcg 
acgacacgta 
ggcgccgggc 
aggaactgct 
tgcgctgcgc 
tcgtccagga 
gccgcggcgg 
tcgccggcgc 
agagcaccgc 



caccccggcg 
gctgctcgtc 
tcgcggtgtc 
cctcggcgcg 
cttcgttctc 
cgccggcgat 
cgtggccgac 
gacccgcggc 
tttccggatc 
ggcggccgcc 
gccggccgcc 
gctgcccgcc 
gaacggcaag 
ccgggcaccg 
cgtgaccgac 
cgtgtcgctg 
ccagtcgggc 
ggccgtgccg 
cctcgtcgac 
cgcgaacgtg 
cctgatggac 
gtcccgcctc 
ccggaccgcg 
cacgctgccg 
cgccaccgcg 
ggcccggccg 
ccacgccacg 
cgagctcgcc 
gccggccgag 
cccgtacggg 
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ctgtggcagg 
ggcggcgagg 
accaacctgt 
agcccggccg 
gacgacgcgc 
cagctcgcgc 
ccgttcggcc 
ggcagcctgg 
gagctgggcg 
gccgtccgcg 
ggcgagccgg 
tacatggtcc 
ctcgaccgca 
gccgacgagc 
gtcgcagccg 
gtcgaacggt 
acccccgagg 
gccaacggca 
ctcgacgccg 
gccgacgtct 
ggcggcgacg 
gacgcgttcc 
gtggtgcacg 
gtcgaggagg 
ccggtcgagt 
gacggcggcg 
ctcgacgcga 
gcgcccgagc 
caccgggcgt 
ctgaccgacg 



cgctgctcgc 
cgctgccgcc 
acggcccgac 
ggccggtgcc 
tgcgcccggt 

ggggctacca 

gcggtgagcg 
agttcctggg 
aggtcgaggc 
aggacgtgcc 
agcccgaccc 
cgtcggcggt 
aggcgctgcc 
gcgaggcgct 
atgccgactt 
tgcgcgagca 
gcctggccgc 
tcccggacgg 
aggagatcgc 
atccgctcgc 
acgtgtacgt 
tggccgcgct 
acggcctgcg 
tgaccctgac 
tcgcgctcga 
gatggctggc 
tgctcgccga 
cgttccgcgg 
acttctcccg 
cgcgggacgc 



54540 

54600 

54660 

54720 

54780 

54840 

54900 

54960 

55020 

55080' 

55140 

55200 

55260 

55320 

55380 

55440 

55500 

55560 

55620 

55680 

55740 

55800 

55860 

55920 

55980 

56040 

56100 

56160 

56220 

56280 



gcggccgacc 
cacggcgagc 
gctgggcacg 
cggcgccggc 
ggtgcgcctc 
cgagctgatc 
cggcgggacg 
cgacaccggc 
ctacccggtg 
ggcccagccg 
caccgacgcg 
ggcggagcgc 
gacgctgccc 
cgccgagaac 
ctcgctgatc 
gccgggcctg 
cgatcccggc 
gctgctggcc 
ggacgacccc 
ggagcgcacc 
gacgggacgc 
ctccacctgc 
cgacgcggcc 
accggcggag 
caagctgttc 
cgccggcctg 
gcttctcgac 
caccatcacc 
gatcggcgtg 
gcccgtgggg 



ggaaaggccc 
gagctgggcg 
cttgccggcc 
gcccggtccg 
gccgccgccg 
ggtcacgagc 
ccgctgttca 
gacgaggaga 
gccgtctccg 
gccgcaccgg 
ctcgcgggca 
cgggaggttc 
gcgatcgtcg 
gaccggctca 
gcccgcgggg 
gtggtggcga 
tatccggcgc 
acggccggca 
ggcaccgcgg 
cacccgctgc 
ccgaagggcg 
cttcagctgg 
accttcgaga 
acgctctcgg 
ctcaccaccg 
aaggcggtct 
cgggtgcccg 
acccttcatc 
cccatctccg 
gtcgccggtg 



atcgcgaggt 
tgagcccggc 
gcgacgacgt 
ggcgagccct 
gcagccgcga 
acgccccgct 
cctcgatcct 
tcgagggcat 
tcgacgacga 
accgcgtcgc 
cgcccgacgt 
tccggacgcc 
gcgagtgggc 
cgtacgccga 
tcggtcccgg 
tgctggcgat 
cccggctggc 
ccgcggacgc 
cggagctggc 
gccccgggca 
tgctggtgcc 
caccggacga 
tctggggcgc 
tggccgagct 
gcctgctgca 
acaccggcgg 
gcctcgagct 
gcgtacgccc 
acacccgggt 
agctctacac 



cgaccggcgg 
gaccgtgttc 
cgtcttcggc 
cggcccgttc 
gacgctggcc 
gacgctggca 
caactaccgg 
cgagctgctc 
cggttcgggc 
cgagctgctg 
ggagccggcg 
gaacgccacg 
gcggaccacg 
gctggacgct 
tgccgtcgtc 
cgtcaaggcc 
ccggatggtc 
cgtgcccgcc 
ccggctggac 
cccggcgtac 
gcacgccggc 
cgtcctgccg 
gctgctcaac 
gcgggccttc 
cgaggtgatc 
tgacgtgctc 
ctacaacgcc 
ggaggacctc 
gtacgtgctc 
ctccggcatc 
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ctggccgccc 
catctcgcct 
accgtcctgc 
atcaacaccc 
gggctgcgcg 
caggccgcga 
caggggccgc 
tccaccgagg 
ttccggctca 
cacacctgcc 
cggatcgacg 
gcccgcgacg 
cccggcgcga 
cgcgccaacc 
ggcatgctcc 
ggcggcgcct 
gaggacgccg 
gggccgcagc 
ggcgacccga 
ctgatgttca 
atcgaccgca 
cacctctcgt 
ggcgccaccc 
ctcgcggacc 
gacgccgacg 
tccccggcgc 
tacggcccga 
gacgcgggca 
gacgacgcgc 
gggctggcgc 



gcgtgcgggc 
gggcgcgggt 
tgggacggct 
tgccggtgcg 
cccagctggc 
gcggcgtgcc 
ccgccggcga 
aacgcagcaa 
ccgtcgacgc 
tgcaccggct 
tgctcggcga 
tggcggcggc 
ccgcggtcac 
gcctggcccg 
tgccccgctc 
acctgccgct 
cgccggcgct 
gactgctgct 
tccgcgacga 
cctccggatc 
tggtccgccg 
cggtgtcgtt 
tcgccgtcgc 

ggggcgccac 
tgaccgccct 
actgccggtc 
ccgagaacac 
cgggcgtacc 
tgcggccggt 
acggctacgc 



56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 



cggacgaccg gcgccgaccg cggagcgctt cgtggcgtgc ccgttcgcac ccggtgagcg 58140 

gatgtaccgc accggcgacc tggtgcgctg gaccgccgac gggcgcctgc tgttcgccgg 58200 

ccgcgccgac aaccaggtca agatccgggg cttccgggtg gagccgggcg agctcgagac 58260 

ggtcctgtcc ggacatccgg ccgtggcacg ggccgcggtg ctggcgcgcg aggacacgcc 58320 

cggcgccaag cggctggtcg catacgtcgt gccggcccgg ccggacgagg acggggacgc 58380 

gctggccgag tccgtgcgcg cctacgccgc ccggcaggtg cccgactatc tgatgcccgc 58440 

cgcgacggtg gtgctcccgg acctgccgtt gaccagcagc ggcaaggtcg accgggccgc 58500 

gctgccggcg ccggacgtgc cgggcgggcc gggccgcgcc gccggcacgc tcaccgagga 58560 

gatcctctgc ggcgtcttcg cccaggtgct cgggctgccc acggtcggcg tcgacgacga 58620 

cttcttcgcc agcggcggcc attcgctgct ggccacccgg ctggtcagcc ggctgcgtgc 58680 

cgtcttcggg gccgagctgc cgatccgggc cgtcttcgag gcgccgacgc cggccacgct 58740 

ggccacccgg ctcggcgcat ccgcgccgcg gcgactcgcg ctcggcgaac gcgcccggcc 58800 

ggagaacgtg cccctgtcgt acgcccagcg gcggctgtgg ttcctcgacc gcctggaggg 58860 

acaggacggc acctacacca tcccgctcac cgtgcggctc gacgggccgg tcgaccgggc 58920 

ggcgctcgcc gcggccctgc gcgacgtcct ggagcgccac gaggtgctgc ggaccgtctt 58980 

cccgctcgtg gacggcgaac ccgtccagcg ggtgctgccg gtgcacgaca ccggcttcac 59040 

gctcggcggc ggtgacgtcg cggccgccga cctcggcgcc gcggtcgccg aggccacggc 59100 

cggcaccttc gatctggccg ccgagatccc ggtgcgcgcc tggctgttcc gcgccgggcc 59160 

cgaggaccac accctcgtgc tgctggtgca ccacgtcgcc ggcgacggct ggtcgatgac 59220 

gccgctggcc cgcgacatcg ccaccgccta cgacagccgc cgcgagagcc gggcgccgca 59280 

atgggagccg ctgcccgtgc agtacgccga ctacgcgctc tggcagcgcg aactgctcgg 59340 

cgccgaggac gatccggaga gtttgctgtc gcggcagctg gcctactggc gggacgcgct 59400 

cgacggcgta ccggaggagc tggacctccc ggccgaccgg ccgcgcccgg ccgaggccac 59460 

gcaccgggga cacgaggtgc ccgtgcgggt gccggccgag gtgcaccggc gcctggccga 59520 

gctggcccgg tccgagggcg tgaccgtgtt catggtgctg caggccgcct tcggcacgct 59580 

gctgtcccgc ctcggcgccg gcgccgacat cccgatcggc acggcggtcg ccggccgcac 59640 

cgaccaggcc ctcgacgagc tcgtcgggtt cttcgtcaac acgctggtga tccgggccga 59700 

cctgtccggc gaccccacct tccgggagct gctcggccgg gtgcgcgcca ccggcctgtc 59760 

cgcctacgag caccaggacg tcccgttcga gcggctcgtc gaggtgctgg caccggcccg 59820 

atcgctcgcc cggcacccgc tcttccaggt catgctcacg ctgcagaaca ccggccgcgc 59880 
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ggacgccggc gaccaggccg tcccgccggc cgccggatcg gccgcggcca agttcgacct 59940 

cgagatcagc atcgcggaga cgttcgccgc cgatggcgag ccggccgggc tcagcggcgt 60000 

tctcatcgcc gccgccgacc tgttcgagcc ggccaccgcg gccgcgttcg ccgaacggct 60060 

ggcccgcgtg ctggccgcgg ccggcgccga tccgcggctg cgggtcagcc aggtcgacat 60120 

cctcagcgcc gaggagcgcg aggccgtcct gtccggcggc aacggcggca ccgcgccggt 60180 

tcccgtcacc accgtcccgg cg'ctcttcgc cgagcaggcc cgccggaccc cgggcgcggt 60240 

ggcggcgctg agcgagggga tgtcgctcac ctacgccgat ctcgccgccc gcgtgaaccg 60300 

gctcgcccgg cacctggtga gcctcggcgc cggaccggag accgtcgtcg gtatcgccat 60360 

gagccgcggc ctcgacatgc tggtggcggt cctcgcggtc gggcaggccg gcgccgccta 60420 

cctgcccgtc gacccgtcct acccggacga gcgcaaggag ttcatgctca ccgacgccgg 60480 

cgccgcgtat gtgctcacct tggcctcgga cgccgaccgc gtgccgccgg gaaccccggc 60540 

cgccgccgtg gtcctggacg agcccgtgac ggccgcgcgg atcgccgggc tcgatccggc 60600 

cgacctgacc gacgccgacc gggtggcgcc gctgctgccg gcccaccggg cgtacgtcat 60660 

ctacacctcc ggatccaccg gccggcccaa gggtgtcgcc gtcgagcacc gcaccgtggt 60720 

caacctgctg tcctgggcgg ccgggcggtt cggcggcgcc gacttcgccc ggacgctcgc 60780 

cgccacctcg ctcaacttcg acgtctcggt cttcgagatc ttcgggccgc tggtgtccgg 60840 

cggcagcatc gagatcgtca ccgacctgct cgccctggcc gacccggcct ccccggcctg 60900 

ggaggccagc ctggtcagcg gcgtgccgtc ggcgttctcg cgggtcctcg accggggcga 60960 

catcgccgcg cgcacccgca gcgtggtgct ggccggcgag gcgctgaccg ccgacgtggt 61020 

gaacgccacc cgtgccgccc tgcccggtgt ccgggtggcc aacatctacg ggccgaccga 61080 

ggcgaccgtc tactcgaccg cctggcacac cgaccgggac gtgaccggcg gcgccgcgcc 61140 

gatcgggcgg ccggtcacca acacccgcgc ctacgtcctc gacgaccgtc tcacgccggt 61200 

gccgccgggc gtggtgggcg agctctacct ggccggcgcc cagctggccc gcggctatct 61260 

gggccggccc ggcctgaccg gcgagcgctt cgtggcctgc ccgttcggcc cgggcgggga 61320 

gcgcatgtac cgcaccggcg accgggtccg gtggaacgcc gacggcgacc tggtcttcgc 61380 

cggccgggcc gacgaccagg tcaagatccg cggcttccgt atcgagccgg gcgaggtgca 61440 

ggccgtcgtg gcgcgccagg ccggcgtggc ccgggcggtg gtgctggccc ggagcgactc 61500 

gcccggcgac gcccgcctgg tcgcgtacgt cgtgccggcc gaccgggacg ccgaccgccg 61560 

ggcgctggcc gccaccgtcc gctcggacac cgcgcgcgag ctgccggcgt acctggtgcc 61620 

ggcggccgtg gtggtcctcg acgagctgcc cgtcacggcc aacggcaagc tcgatcgccg 61680 
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tgcgctgccc gcgcccggcc tggccgaggc gggcagcggc cgcgggccgg tcacccaccg 61740 

cgaggaggtg ctctgcgagg tcttcgccca ggtgctcggc ctgccctcgg tcggcgtgga 61800 

cgacgacttc ttcgcgctcg gcgggcactc cctgctggcc gtctcgctgg tggagcagct 61860 

gcgccgccgc ggcgtgacgg tcggggtgcg cgcgctcttc cagacgccca cggtcgccgg 61920 

cctggccgag gcggccgcgc ccaccacggt cgccgttccg cccaacctca tccccgagga 61980 

cgcgcggcac atcacgcccg gcctgctgcc gctcgtggag ctggagcagg ccgagatcga 62040 

ccaggtcgtg gccactgtgg acggcggcgc cgccaacgtg gccgacatct atccgctcgc 62100 

gccgctccag cagggcatgc tcttccacca cctcatggcc ggcgacgacg gcgaggacgt 62160 

ctacatcatg cccgcggtcg tggagttcga ctcggcggac cgcttcggcg ccttcgtcga 62220 

cgccctccag cacgtgatcg accgcaacga cgtctaccgc accggcgtgg tctgggacgg 62280 

cctgcgcgag ccggtgcagg tggtctggcg ccgggcgccc ctgcccgtga ccgaggtgac 62340 

gctcgatccg gccggcggcg atcccgccgc ccagctgcac gccgccgccg gcgcccggat 62400 

ggacctgaac cgggcgcccc tgctcgacct ccacgtggcc gcccggcccg aggacggcca 62460 

acggctggcc ctgctgcggg ttcaccacat ggtgcaggac cacatggggc tcgaggtgct 62520 

cctcggcgag gtgcaggcgt tcctggccgg ccgcggcgac gagcttcccg atccgctgcc 62580 

gttccgcgac ttcgtggcgc agacccgcgg cggggtgccg gaggccgagc accggcggtt 62640 

cttcgccggg ctgctgggcg acgtcaccga gcccaccgcg ccgtacggcc tgctcgacgt 62700 

gcaccgcgac ggcgtcggcc tggtgcgcca ggaacgcccg ctcgacggtg aggtggtggc 62760 

ccggctccgc gccgtggccc gccggctcgg ggtgagcccg gcgaccgtca tgcacgtcgc 62820 

ctgggcgcgc gtgctcggcg tgatctccgg ccgcgacgac gtggtcttcg gcacgctgct 62880 

gctgggccgg ttcagcaccg gcgccgaccg ggtgcccggc ccgttcatca acacgcttcc 62940 

ggtgcgggcc cggctcggcg gcacgggcgc cgcggcggcg gtggcggaga tgcgccggct 63000 

gctggccgag ctgctcgagc acgagcacgc gccgctgacc acggcgcagc aggccagcgg 63060 

actctccgga aacctcccgc tgttcacggc gctgttcaac tatcggcaca acacgtcgcc 63120 

gggtgcggac ccgtcgcccg cggccggccc gaccgagggc atccgcccgg tctccatgcg 63180 

ggagcgcacc aactatccga tctcggtggc ggtggacgac gacggcgagg gcctcggcgt 63240 

ggcggtcaac gcgatcccgc cggtgcggcc ggaggcggtg tgcgagctcg tggcgaccgc 63300 

gaccgagagc ctgacctcgg cgctggagct gttcctcgac ggcggtccgg acaccgcggt 63360 

cggcgagctc gacgtgctgc cgccggggga gcggtcgcgg ctgctggtgg agtggaacga 63420 

cacggctcgt ccggtggtgg agtcgtcggt gccggcgttg ttcgccgagc gggtggcggc 63480 
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ccggatgcgc gccgtgcagc ccgaggggcc gtaccacgtg ctcggcttct ccttcggcgt 65340 

ggcgcccgcg cacgagatcg ccgtccagct gcgcgagcag ggcgccgagg tcgtgctggt 65400 

gctcatggac tcctatccca tggaggatgc ggagtccggc gagcaggcgg ccgacgagga 65460 

ggagctgccg tgggaggagc tcatcgaggc cgagttcggc cgggtgctcg gcggcttctc 65520 

ccgcgacgaa ctggcggcct tcgccgccgt cttccgcaac aacaccaaga ttcgcgcacg 65580 

ccaccggctg ggccgcttcg acggggacgc cctgctgatc gcctcgaccg acagcgcacc 65640 

cgacggcgag tccaacacct ggcggtgggc gccgtacatc accggtgaga tcacccaggt 65700 

ggtgctcccc tgcgagcaca ccgacctggt acgccccgac atgctcgcgc tgctctggcc 65760 

ggccgtcgag gcgtggcagg ccgggcggca ccgaccttag tcaccacagc gtcgagagga 65820 

ccgacatgca gaagatcccg ctcgtgtgtg tgccgttcgc cggtgccggc gcctcgttct 65880 

tccacccgtg ggccgagctc gccgggccgg accggccgat cgtcgcgctc cagcttccgg 65940 

gccgggagtg gcggctgctc gacgaaccgt acgcggacgt cgtcgcggcg gccgcggacc 66000 

tggcgctcac cgtcgccgac gaggtgggcg cggggggccg ggtggcgctc ttcgggcaca 66060 

gcctcggcgc cgtcctcgcg tacgagatag cgcacgcgct ggtgcgcgac ggcgaggtgg 66120 

gcgtggagcg gctcttcgtc agcggctcgc ccgatccctg gacccctcgc accaaccggg 66180 

cgagcggcct ggacgacgag gagttcctgc tgcgggtgcg cgagttcgcc ggttacgacc 66240 

acgaggcgct cgccgatccg gacatgcgcg agctgatcct gcccgcgctg cgcgccgacg 66300 

tcgagatgca cgagagctac gtggcgggca gcgccgatcc gctgcccgca cccgtcaccg 66360 

cgctgcacgc ccgcgacgac gcgctggtct ccgccgagca gacggccggg tggagcaagg 66420 

ccaccagcgg cccgttccag ctggtcgagg tggacggcgg ccacatgtac ctcaccgagg 66480 

acccggccgg cctgctgcgc ctgatcgccg ccgacctgga ccgtgactga acccgaggag 66540 

aacccgtgcg cttgaccggc aagaccgcca tcgtcaccgg cgcggcccgc ggcctcggcc 66600 

gcgcctgcgc cgtggccttc gccgccgagg gagccgacct ggtgctcctg gaccgcgcgg 66660 

ccgacctgcc gggggtgccc tatccgctgg gcaccgtggg ccagctggag cacaccgccg 66720 

acctctgccg caagcagggc gccgcggtgc tcaccgtccg ggcggacgtg cgcgacctcg 66780 

cggcgctcac cgcggcggcc gatcgggcga tcgaccgctt cggcggcatc gacgtgctcg 66840 

tcaacaacgc gggcatcgcc gcgccgtccg gaaaggtcac ccacgagatc accgaggacg 66900 

agtggcagct gatgatcgac gtcgacctct ccggcgcgtg gcgcatgacg gcggcggtcg 66960 

gccggcacat gaccgagcgc cgctcgggca gcatcgtcaa catcgcctcg acggccggtc 67020 

aggtcggcta ccggcacttc gccggctacg tcgccgccaa gcacggcatc gtcgggctca 67080 
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cccgggccgc cgcgctcgac tacgcgccag cgaaggtgcg ggtcaacgcc gtctgcccgg 67140 

gttcggtgcg cgacgatccg cagttcgagg gccggatgct gtcggagatc gcccggtcgc 67200 

tcgacgtgcc ggtcgccgag cacgagcaga ccttcctgca ggcgcagccc atgaacgccc 67260 

tcatcgagcc ggacgacgtc gccaacgccg cgatctggct cgcctccgac gaatcgcgcc 67320 

aggtcaccgg ctccgtcgtc acggtcgacg gcggattcac cacgcgctga acggggagaa 67380 

gacgtgccca agtcccagcc cgccaccaga accgcggcgc ccggcgccgc cgagtgccac 67440 

gcgcttgccg tgcgcctggc cggaccgatc gacccggcgc cgatcgagcg gcggctcgcc 67500 

gcccgcatgc cgttctggca cgagcacgtg gcggcccggc cgggcgatga ggccgcgctg 67560 

cgccgccgcg agcgcgaact cgcccgcccg gtgccgccgg agcccggtgc gcgggcggtg 67620 

ctgctcgcct acgcggacgg ctcggccgac ctggtgctgg tggcccgccg cgaccgcctg 67680 

gaccgcgacg cgctgatcgc cctggcccgc ccggagcggg cgccgcgcgg gcgcaaaccg 67740 

gcggaaccgg acgcgccgcc gccctcggcc gcgcccgcct ggggcctggg cgacggcggc 67800 

ccggacgacc ggtgggccga gctgcgcgtg ccggcgcgcg gcccggccga cccggcgcgc 67860 

tggcccgccg cgctcgccaa ggtcctcgcc cggtacgagc cgggcgcggc tgcgggctcg 67920 

ggcgcggcgg cgggcttggg cgcggcggcg ggctcgggtg tggcggcggg ctccagcgcc 67980 

gcctcgggct ccggcgccgc cgcggtcccc ggcccggtgg cgctggcctt cgacggcgac 68040 

ctcgccccgc cggacgagta cgtgcccttc ctggcgccca cccacccgct caccgtgcag 68100 

gtctcccgga cgcccggcgg cggcaccgag ctgcggtgcc gccaccggct cggcgcggtc 68160 

tcgccggccg ccgccgaagc gttcgcccgg atgctggccg cggcacacgg cgagccgccg 68220 

gccgatgacg gcgcgaccgc cgagcccacg ccgccggccg cacccgcacc cgcacccgca 68280 

cctgcgcccg cgccgccggc tgcggctcgc acgctgaccg ggctcttcgc cgagcaggtc 68340 

gccgctcgtc ccacggccgt cgccgtctcg gacgatcggg gtcggcacac ctaccgcgag 68400 

ctcgacgagt ggtccggccg gctggcccgg gggctgcgga aggccggcgt gcgcgacggc 68460 

gacgcggtcg gcgtctgcct cgaccgctcg gccgagctcg tcgccgtgct cctcgcggtc 68520 

ctcaaggccg gcgccgccta tgtgccgctc gacgcggcgt acccggccga ccgcatcgcc 68580 

tacaccgtcg gcgacgccgg cctcgcggtc gtggtcacca cctcggcgga ctttcccgac 68640 

gtggacggtg tgcggctgct cgcgccggag agcctcgccg aggccggcga cgacccgggc 68700 

atcccgctcg ccaccccggc cggcccggag cggccggcct atgtcatcta cacgtccggt 68760 

tccacgggcc ggcccaaggg cgtggtcgtc ccgcacgcca acgtgtccgc gctgctcgac 68820 

gccacgcgcg aggagtacgc gctcggcccc ggcgacgtgt ggaccttctt ccactcggcc 68880 
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gccttcgact 
gtcgtcccgt 
ggcgtcaccg 
ggggcggagc 
cggacggtgc 
ttcggcatca 
ctcgccggct 
gagcagcgcc 
gtggcgatcg 
gacggccggc 
gaacacctgg 
gagatccggg 
cgctccactc 
ggcgccacgc 
gcgaccttca 
gccctgcccg 
gaggacgggc 
accgtctcgg 
gcgctgatgc 
accgtgcgcg 
aagtacggga 
ctgctgagcg 
gcgccgccct 
acccagacct 
gacaccaccg 
ctcgcctcct 
gtcgtcaccg 
gacatcagcg 
cagcagggca 
caggtgaccg 



tctccgtctg 
actgggtgtc 
tgctcaacca 
gcgacctcgc 
tgccctggct 
ccgagaccac 
cccggtcggt 
gcgaggtgcc 
gctacctcaa 
gctggtaccg 
gccggctcga 
gcgtgctcac 
cggacgatcc 
cgccggtggc 
cgttcctgga 
agcccgcgcg 
tcgcgggcga 
acaacttctt 
gccgccgcgg 
gcctcgcgga 
gaagcaccat 
tggcggcgtg 
cggccgtgac 
atcaccaggc 
agatcggcaa 
accagcagca 
ggctcacccc 
cctggccgct 
gcgggcccag 
agctggccat 



ggagatctgg 
ccgctcgccg 
gacgccgtcc 
cgtacgcctg 
ggaccgccgt 
cgtgcacgtc 
cggccgcccg 
gccgggcgtg 
ccgcccggag 
ctccggcgac 
cgaccaggtc 
cgagtgcgcc 
ggcgaccgcg 
cgagcacgcg 
cgcgctgccg 
cccggccgcc 
cctggccgac 
cgacctcggc 
cctgccccgg 
cgcgttgcgc 
gcgaaacctg 
cagctcgacc 
ggccaccggc 
gctcgacgcc 
gtacgcgtcc 
gggcgtgcac 
gccggccgac 
gacgaaggcc 
cgcgatcctg 
ccagggaccc 



ggctgcctgc 
gagcagttcc 
agcttcacgc 
gtgattttcg 
cccgaggcgc 
acggcggtcg 
ctgcccggct 
ccgggcgaga 
ctgaccgccg 
cgcggccggc 
aagctgcgcg 
ggggtggcgg 
cggctcgacg 
gcccggatgc 
atgacgccga 
gacgctgcgg 
gtgtggcagc 
ggcaactcgc 
ctgcatccgc 
tcggcctgac 
cgtcggacca 
cccgcggcga 
ccggcggccg 
ttcgtcgcgg 
ggccgggcgc 
accagcggcg 
cccaccggcg 
gacgggaccc 
gcgaacgtcg 
tgcgcggcgt 
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tcaccggcgg 
acgacctgct 
agctcgtggc 
gtggtgagcc 
gctgccggct 
acgtcacgcg 
gggccgtgcg 
tctacgtggg 
agcggttcgt 
tgctgcccga 
gcttccggat 
cggccgcggt 
cgtacgtggt 
tgccggccta 
acggcaaggt 
cgacgccggc 
aggtcttcgg 
tgctcgccgt 
gcaccctcta 
agtcccctcg 
ccggcatcgg 
gcgagccccc 
agaaggccgt 
cgagcaacaa 
tgatgacctt 
agccgcgcat 
tccagctgcg 
cggccgacaa 
cccgctcggg 
gaccgtccgg 



ccacctcgtc 
cgccgagcgc 
cgccgaccgc 
gctcgacgcc 
ggtcaacatg 
cgcggccgcg 
cgtgctcgac 
cggcgccggc 
caccggcccg 
cggcaccctg 
cgagctggac 
cgtcatccgg 
cgccgaggcc 
catgtgcccg 
ggacaaggcc 
cggcccgggt 
ctgcccggtg 
gcggatggcg 
cctgcacccc 
ccccctcgag 
actgctcgcg 
gccgtccgcg 
caagtcgggc 
gggcacgacc 
ccagggcatc 
cgacgagccg 
cggctgcatc 
ggtgggcggg 
tgccacctgg 
cgatggctac 



68940 
69000 
69060 
69120 
69180 
69240 
69300 
69360 
69420 
69480 
69540 
69600 
69660 
69720 
69780 
69840 
69900 
69960 
70020 
70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
70620 
70680 



cggccgggct cacggtcctg gcgttcgccg ccggcttctg gcagaagctg ccctgccagg 70740 

ccgctggctg gccggacgac accgcgacgc tgttcggccg ctactgctac agcgacgtgc 70800 

cgattctctt ccgggagcgc ggccttttcg acggcatttt cccgtacgag tccgggccgg 70860 

gcgcccagcc gctggagtac ccggtcctca ccggctacct gatggacgcc acggcccggc 70920 

tcgttcgcgc gatcctgccc ggcgcggacg tggccgtcgc ctcccgggcg tacttcctca 70980 

cgacggtcct ggtgctgctc gccctcgcgg tcctgaccgt gtgggcgacc ggtgcggtgc 71040 

tgcgccgcac cggcgggcgg ccgggcgacg cgctgctggt cgccgccgca ccggtgctga 71100 

tcctggccgg cacggtgaac tgggacctgc tcgcggtcgc ggcggcggtg ctcgcgatcc 71160 

tcgcctggga acgggaccgc ccgctgctgg ccggcgtgct gatcgggctg ggcacggcgg 71220 

ccaagctgtt cccgctggtg ctgctcggcc cggtgctgct gctctgcctc cggcagcggc 71280 

ggatgcggcg gttcgcccgc gtggccgccg gtgccgccgg ggcctggctt ctggtcaacc 71340 

tgccggtggt cgcgctgcaa ccggacggct ggatggagtt ctggcggttc aacgccgggc 71400 

gcggggccga gttcgggtcg ctctggttcg cgctggacgg gctcggcctg cacatgccgg 71460 

cggtgaacgc cgtcgccctg gcgacgttcg gcgtgctgct ggccgggatc gcggtgctgg 71520 

ctctgcggtc gcgccggccg ccggacctgg cgcaactcgc ctgcctggcc gtcggcgcgt 71580 

tcctgctgac caacaaggtc tactcgccgc agtacgcgct ctggctcctg ccgctcgtgg 71640 

tgatcgcccg tgggcgggtc ccgcggtggc cggtggtgcg cgactgggcc gtctggcagg 71700 

ccgccgaggt gctctactgg ctcgcggtgt ggagctggct cgccggttcg ctgaccgacg 71760 

agcggcagta cgcctgggca accgtcctgc gcgtgctcgc cacggcgtac gtctgtggtc 71820 

aggtggtgtg ggacgtgctc gccgcccctc gcccgcaccg gccggcgccg cccccggcgg 71880 

tcgccgagcc ggcccacccg ggctgacccg ggcaacgaga gatcccctcg ccgccggcgg 71940 

cgaggggatc tctgttcgtg tgctcagtcg gcggcgaccc acgtgaggcc gttgccggaa 72000 

ggggtcagcc cgccctcgaa cagcggctgc tcctgcgtgc cctccgggcc caggatcgcg 72060 

ccggcctgga tctgcgagcc ttcgaacatg gcctgcgcca cgacgctcga cccgcgcagg 72120 

aagccggtgc cgccctcctc gaggccggcc accgcctggc ccagctccgc ggtctgccgg 72180 

gagtgccgcc gcaccagctc ggtggagccg gtcagcgcgt cctcgccgga ggcgatgccg 72240 

ccgaccaact cggtgaacga ctccatggcc ggtgcgctgc tctcggtcac cttgcgggcg 72300 

gcccagaagt acgagctctc atcgacgtgc atgtcgtaga agctcaccag gaagtcgtgg 72360 

aagacgccga actcgcgccg gtagcgctgc tcgaactcgg tgaaggcgcg gtcctcgtcc 72420 

accgtgccgg ccagcacgct gttgatggag cgggccgcga gcaggccgct gtaggtcgcc 72480 
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aggtgcacgc cggaggagaa caccgggtcg atgaagcagg cggcgtcgcc gaccaggcac 72540 

atgcccggcc gccagtacga ggtgctggag taggagtagt ccttgcggac ccggatctgg 72600 

ccgtagtcgc cctcggtgac ccgggtggcg tcaccgagga agtcggcgat catcgggcac 72660 

tcggcgatca gctcgaacag cgccttctcc tggtcgccct gcaccttgtg tgccatctcc 72720 

cggcggacca ccgcgccgac gctggtcagc gtctcgctga gcgggatgta ccagaaccag 72780 

ccggagccga aggcgacgca gaggatgttg ccggagttcg gcgcgggcag ccgccggccg 72840 

ttctcgaagt agccgaagag cgcgaggttg cggaagaacg gcgagtattc gcgggtgccg 72900 

ccgaccgtcg tgtgcagggt gctgcggttg cccgaggcgt cgacgacgaa gcgcgccagc 72960 

gcggtgcggg tctgcccgtc ctgggtgaaa cggacgccgc ggacccgctc gtcgtcggcg 73020 

atcgcctcgg tcaccgggtg gttctcccgg acgtcgacgc ccagccggcg cgcgttctcc 73080 

agcaggatct ggtcgaaccg gcggcgctcc acctggaacg cgtgcgaggt cggcccggcc 73140 

atccggggcg aggtggcgaa ggtgaacgtc cacggctcgg tactggtgcc ccacttgaac 732 00 

gtgccgccgt gcttgcgcat gaacccggcc ttggcgatct cgtcgccgac cccgagcagg 732 60 

ttgcagacac cgtgcacggt ggacggcagc agcgattcgc cgatctggta ccgcgggaac 73320 

ttctcccgct ccagcagcag caccttggcg ccctgcttgg cggtcagcgc cgcggccgtg 73380 

gaaccgcccg gcccgccgcc gaccacgatg acatcgaact cttccggttg agcagccacg 73440 

aatcctcctc agcgaccatc tgagctgcat gatggttgcc gcgggccgct gtgcgccacg 73500 

tgtccgccca cccgcggcgt tcgaccgcgg atgtcgccgg tcgcacgctc gcgcgggccg 73560 

ggtcagctcc cctcgaccag ccggttctcc caggcccagg ccgccacgcc gacccggttg 73620 

cgtacgccca atttggactg gatcgtggag gcgtgcccct tcaccgtgct cagcgagatg 73680 

aagagatcgg ccgcgatctc ctggttcgtg cggccgcggg cgatcgcccg cgccacctcg 73740 

agctcgcggt cggagagggg aatgggctcg gagccggccg gcgccgccga ggcgttcagg 73800 

tggttcagca gccgcacggt gaccgacggc gagaccagcg cgtcgccctt gtgcgccgcc 73860 

cggacggcct ccaccagcag tgtcgggccg gcgtccttga ggatgaagcc gaccgccccg 73920 

ccgcgtagcg cgccgtagac gtattcgtcc gagtcgaacg tggtgaccac gatgacccgc 73980 

agaggattga ccacgccggg gccggccagc gagcgggtca cctcgatgcc gtcgatgcgg 74040 

ggcatgcgga tgtccaccag gcacacgtcc ggccgcagct tgcgcgcctg tgcgaccgcg 74100 

tccacgccgt cgacggcctc ggccaccacc tcgatgtcgg gctcgtcctc gaggatcagg 74160 

cgcaggccac tgcggatcat cgcctgatcg tcggcgatca ggacacggat cgtcatcgtc 74220 

gctccctcgg tgggttgggc agggtggccg cgaccgacca gccgccgccg ggacgggggc 74280 
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ccgtgctcag 
cgccgcggtg 
tctcgtcggc 
gggcgacgtt 
gccactgctt 
gctcgacaag 
cgcgcagcag 
tgaccgccag 
gggtctgcag 
gttcgaggcg 
tccgcagcga 
ccggcgccgg 
gggcggcgcc 
tgaggacggc 
ccgccgtgac 
cgcgccgccg 
agtaggccag 
agaggacggc 
atccaagggt 
tcatgccgcc 
gcccgaaatc 
gccatgtctc 
ggccggcgga 
ctggtcggac 
gcgccgagcc 
ctggcgcggg 
gaccacctca 
cgcatcgcct 
cgggccgagc 
gagcccggcg 



cgtgccgccg 
gtgcaaccgc 
ctgctccacg 
ggtgagcgcc 
catgccgtcg 
cgtgctcagc 
gcccacgacc 
ccgctcgggc 
gatcatgccg 
ttcctcctgg 
caacccggtg 
gccggcgtcc 
gcccacggcg 
cagggccagc 
gacggcgccc 
gatcagggcg 
gccccagctc 
cagccccgcg 
acggatgccg 
caaggatagg 
aacccttctt 
acgagcgctc 
gggccctgac 
cgcgcggctc 
ggggccggat 
tgggatacgt 
cgatgggcgc 
cggccggcat 
tggcgttgac 
cggtgctgga 



aggctctcca 
ggtggggccg 
gtcacggtca 
tcgcggacga 
ggagtggtga 
tcctcgggct 
cggcgcatgg 
acgcgcccgg 
gtgatgtggt 
cggatgcgct 
gccgtgccgc 
caggtcgcgg 
aaggcgggga 
gccatggccg 
gccgcgatcg 
agcaggcaga 
tgcgcgacgg 
tcggccacca 
gcgccgtcgc 
ggcgaatcgg 
agggcggtac 
cactcccgtt 
cgactgcacg 
gggcaagtcc 
ccgggttctg 
accgcgggag 
gcggctcaat 
tccgcgtacc 
cctggccggc 
tgcgccggcc 



cccgctctcg 
ccgccgcgtc 
cggtcacgtt 
tccggtagac 
gccgcaccgg 
ccggtgccga 
ccgcgagggc 
cgtcgcgccg 
gcgcgaccac 
cggcgttcgc 
ccagccaggt 
cggccacctg 
tcgtgggtgc 
ggccgggctc 
tgagcccggc 
cgagcgtcgc 
cgacggcctc 
tggtggtcgt 
cggcggcggg 
gcgccgccaa 
cggtccgcgg 
ctgcaggccg 
ctctccgttc 
acactgctgc 
ggggagcgcc 
ccggcggtgt 
ccgcggtggg 
cggcgcgcgg 
ggcaagcgcc 
cgcgcctcgt 



catgccgacc 
gttggtgacc 
gggtgcgtgc 
cgtggtggtc 
gccgccctgc 
cggtgggccg 
ctcggtgccg 
cgcgagcacc 
gtcgtgcagc 
gcgggcccgg 
gaggatgttc 
gctgagggcg 
ggcccgcacc 
ggccggcatg 
cgcggcgcac 
ggccgcgccg 
gacgacgacg 
cctgtccgcc 
accggttgcg 
ggggtcaaaa 
ggcgctcccc 
agggcctgac 
cctccggacg 
agctgtgctg 
cggacgcggg 
acggctcgtt 
accggcggct 
accggctctc 
cggagctgct 
tcctgcgcgg 



agaccgtacc 
tcgaccctga 
ggcgcgtgcc 
acctcgatcg 
cgggagaagc 
tcgtcggcgt 
gccgtttcga 
tgggcggcct 
tcccgggcca 
ccgtccaccg 
agccaggtca 
accacgccga 
gccgagccgg 
tgcagcacgg 
ggcaccgggc 
ccgatcaacc 
gcggccacga 
gtccgcacgg 
gccgcgatgt 
gtagggtgac 
gacgatgaag 
gaaacgctac 
ggtgatcgcg 
cgggatggtc 
cgcggcgcac 
cacggtggaa 
ggccgaccgg 
cgccggccag 
cgtgctcgac 
cgtgctcgac 



74340 
74400 
74460 
74520 
74580 
74640 
74700 
74760 
74820 
74880 
74940 
75000 
75060 
75120 
75180 
75240 
75300 
75360 
75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
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ttcgtcgccg agatcgacgc gagcgtgctg atctccggtc acccgtccgg agaggtggag 76140 

cggctctgcg accacctgat cgtgctgtcc gactcccggg tgctcgtcgc cggcgacgtc 76200 

cgggacctgc tcgcccggca ccaccgcatc atcgcgccgc gcggcgagct ggaccgcctg 76260 

ccgccgggga tggagcccat ctgggtggag gacttcggct cgtacagcgg gggagtggtg 76320 

cgggccgagg tggacctgcc ccggcggccg tggacggtgg agcgggtcga gctcgaggag 76380 

ctggtgctca gctatctgag ccgggcctcg ggcgcgcccg cgctcgccgg ctgcctgatc 76440 

gcgcccggtc agccggggag ctgacgggcg ctcaggtcca gcagccggct ggtgcgtacc 76500 

cggtcggcgg ccatctcgaa gagctggtgc tccacgtgct cgcgctcggc cacgctcagc 76560 

acggcctcgc cgccgagccg gccggccagc ccgtccagga cggccagcag cgccggccgg 76620 

tggagccggc caccggcgcg ccacaccccg agggcgcagg ccgcggccag cagcacggtg 7 6680 

taccggtcgg ccagcgcgaa ggcggccggg ttggcgtcga tcgtgatgtc ggccggcccc 76740 

agctcccggc aggcgtcccg caggccggtc agctcccggt ggaaccgctc ggcgaaccgg 7 6800 

ccgaccgggc cctcgtcgtg ccacgagtcg tgcaggacgc cggccagcgg gtcgccacgc 76860 

atgccgctga ccagccggga gaagtccaac ggggacagct cgccgccgag ggtgaacacg 76920 

ttgtccggcg gcggctcctc ggccgtccac gaccggcggg ccagccgggg caggtgcggc 76980 

agcagcatca ccaggcaggc cgcgcgggac acgtgcgcga aggaggcggg cgcggcgtcg 77040 

cggaccagct tctggaacat cgcgtacggg ccctgccgca gataaccctg cgcgcccagc 77100 

accgagcgca gatcgtcgat cgcgtccagc acgatctgcg cggtcaggta cttcacggcg 77160 

ggcgcgtaac cggccgtggc ctccggcagc aggtgcagcg cccgcaggcc cacgcccgag 77220 

aagacgtcga cggtgagcag ggcagcatag gcgcgcgcga tggtggtgcg cacgtacggc 77280 

aggtcggcga cggcggcccc gtacagccgg cgctccatcg ccatctcggt ggccagccgc 77340 

accgcgctgt ccagcgggcc caccagcagg gccggcgaca ccatccgggt gacctgatag 77400 

gcccgcagcg ccacctcgat gccgtgtccc tgttcgccca acagcgcgga ggacggcacc 77460 

gggcagtcgt cgaagaccag cccgcccagg tcgatgccgc gcatcccgct gcccggatac 77520 

cgcgggcggt ccaccgcccg cgcggccggc agctcgtcgc gcaccagcag gaactggctg 77580 

tgcgaccggc tgccgcgcgc ctcgccggtg cgggcgaaca gcaccatcgc ctcggcccgg 77640 

cgcaggttcg tcacgatctc cttgtgcccg gacagcaccc atccgccgcc ggccgggcgg 77700 

gccgcgcact cggcggcgga gaagtcggtg ccgtgcgega gctcgtggaa cgcggccgcg 77760 

atgcgcttgt tggccagcag caggccggcc gcccggcgcc gttgctcctc gttgccggcg 77820 

caccacacgt tcaccgaggc gatgagcgag ctgaagccgt agcccagccc caggcacggg 77880 
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tcgcgccgcc agaccgcgcg cagcacctcg gccagccggt cggcgcgggc cagccggccg 77940 

ccgtacgcca ccggcacgaa ctcggcgttg agctgatagc ggtcgagcag ccgctcgccc 78000 

tcggccagca tctcctgccg ctcgtcggcg gcgagcacgg ccgcatagcc gaccgggttg 78060 

ccggggtcgc gggcgtcgcc gagcagcttc tcgagcgccg cggcggtcac cggccgtcct 78120 

ccgcccggac ggtgccggcg ggcggccgca gggccgtgcc gaacggggtc accggttcac 78180 

cggtgcgtgc ggccgccgcg acccgcgcgg ccagcagccg gtcggcgccg tcgtcgtgcg 78240 

ggccgggatc gggcgccggc gcccgcagca cccgggccag ctcccggcgc acggcacgca 783 00 

gcgcggccag cacccacagg ccgtccgccc agagcgggtc cgcggtgtgc cggccctccg 78360 

ccgaggtcca cagcagcagg cacgccgcgc ccgcgtagca ccactcgtag gcggcggcca 78420 

gctcgtggcc caccatggac ggccgggccg agggcccgag ctcggtcatc tgcccgcaga 78480 

tccgccgcgc ctcgccggcc agcgcggccg cgtgccgggc cagccccgcc gggccacccc 78540 

ggacggcggc cgacacggcg agcgccggca gcgcctgcac caccgagcac ccgtggcggg 78600 

agagcagggt cagggcgccg cggtccagcg gcggtggcgg ctgaccggcc gcgctcgcct 78660 

cggccagccc ctcggcggag acggcgccgg ccgcgaagcc ggccgcgagc cgcgggaact 78720 

ggtgcgccag cgccgtgcgg acgaccgggg tgctgccgtc gaacaccgcg accacgtggt 78780 

ggtcgcgcac cagcttgggg aacatgccgt gctcgtactc gtcacgcagg aaggaccgcg 78840 

agccgagcag ttcggccagc tcacgcagca cgcggtccac caccgtgggc acgtacgcct 78900 

tgacgatcgc cgacgtcacg ctcatctcgg ccgtgaggct gtgcaccgag cgcgtgccga 78960 

cgacggcggt cgcctcggcc gcggcgagca gggcggcgca gcgcgccaga atcccggccg 79020 

ggtggccccg gtcgagcagc ggccgccgca tgatcatccg ctgggccacg aagcgggcgg 79080 

tcagccgcag cgcgcggtcg ccggcgccca gcgacagccc ggcgcacatg gtgcgggtca 79140 

gctgcagcga gcgcagcacc gtctccagcc cgcttccggc ccgcccgagc agggccgtgc 79200 

cgggcaggcc ggcgtcctcg aacgcgatgc cggagatgtc gatgccccgc accccgtgcg 79260 

tggccacctt gggccggggc agccacgtgc cgggggcgag cgcctccttg tcgaccagga 79320 

acaggctctg cccgcgggcg tcgccggccc ggccggtgcg ggccagcacg gtgaggtatc 79380 

gggcccgggt cgcgttgttg atcggccact tgagaccgcg cagccggtag ccggcgtcgt 79440 

ggggcagggc cgtggtcgtg ccgtgcagca ggtcggcgcc gtggtcgggc tcggagagcg 79500 

cccatgcgac cggggtgccg gcgagcaccg cggcggcaag cgtcgcgcgc tggccgtcgt 79560 

caccggccag ccagaccggc gccgacccga ggtaggtctt gccgtgcgcg acggccgcgc 79620 

tcaggtcgcg ccgggcgacc gcacgccaca ggtgcaggag ccgctcgtgg tcgccggggg 79680 
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caccgcccca ctccggcggt acgtaccagg attgcagacc gaaggagttc agccggtcgt 79740 

gcagccgggc cgggaactcc tcggcgatgt cgcgggcggc cgtctcggcg tcccatgcgt 79800 

cctcggggcc gagcaggtcg tccaggcgca cctccgcggg tggcgccagc gggcgtacgg 79860 

tcatcggccc gcgcccgcct gcaccggccg cagcgccggc tcgagctcgg cgtgcagcgc 79920 

cgtgatgccg ccggccagga accgctcgcg gaccgcggtc cggcggatct tgccgctcgt 79980 

ggtccggcgc accgtgccgc gccgcaccag cagcacgttg cgtaccggca cgccgaagga 80040 

gacggtgagc cggcggctga cggcgctcgc cacccgcggc agctcgtcga gcggcgtgcg 80100 

cggatgcacc tcctgcacca gcacgatccg ctcgtcgggc gccggcaccc cgaacgccgc 80160 

cccgatctgg tggtccaccg cgtcgtgcgc ggcccgggcc tcccgctcga ggtcgtgcgg 80220 

ggccaggttg cggccgtgca cgatgagcag ctccttgagg cgtccggtga ggaacagctc 80280 

gcctccggtc agcgcaccca ggtcacccgt tcgcacccag ccgccgtcct ggccgtcgcc 80340 

cgcgggcctg gcgtcgaaga tgccggggtt gagctcggtc ctgccccagt agccggcgcc 80400 

cgcgccgggc ccgcgcagcc agatctcgcc gacccggccc tcgggcagcg gccgtagccc 80460 

gtccgggtcg acgatgagca cctcgaagtc gggcacccgg ccgacacccg gtatctcccg 80520 

ggccgcggcg gggtcggccg ggcgcagggc cggcgacgcg gcgtcctcca gcgcgcgagg 80580 

gtcggcggga agaaagaccg ccggcgcctc gaagaccttc gtcgacacgt acgcggtgaa 80640 

ctcggccatg ccgtagcagg ggttcaccgc gtgggtgtgc aggccgaagg gggcgaagcg 80700 

ctcggtgaac gcccggacgg tggccgggtt gaccggctcc gatccgttgt agagcgtgcg 80760 

gatgcgggac aggtcgaggc cggcgatctg ctcgtcccgc acggcgcgca cgcacaggtc 80820 

gtacgcgaag ttgggcgcgg cggagatggt gacccggtag cggtccatca tccgcagcca 80880 

gtccgccggc cggcggacga aggccgtcgg cgacatcagc acgacgccgg cgccgttcag 80940 

caggcccgcg gtgagcatgg cgaacaggcc catgtcgtgg tgcagcggca gccagctgcc 81000 

gaacacgtcg tcgctgttgt gcccgctgct gcggtcgaac gcgcggaggt tggccagcac 81060 

ggcccggtgg ctgagcatca cgcccttggg cgagccggtg gagcccgagc tgtactgcag 81120 

gacggccagc gagtccggcc ggggcccgcg gggcgggcgg atcgcttccg ccgcgagggg 81180 

cggaagcgca ccgaccggca ggccggacag gccgcgctcg cgcaggacgg cggtgagcgg 81240 

cgccgcgtcg tcggcgttga cgacggccag ggcgggggag cagtcggccg cgatgccgac 81300 

ggtgcgctcg gaggcgccgg acgaccctcc gggcgggggc gccggcacgg cgaccaggcc 813 60 

ggcgtagagg cacgccaggt agagctcgac gaactcggcg ccggtgggca gggcgatgag 81420 

gatgcgctcc ccggccggga accgggcggc gagccaggcc gcccgcgccg cggcgcggcg 81480 
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gtcgagctcg gcgtacgtga gggtgacggg ctcggcgtcc gggtcgccgg ggagcaccac 81540 

gagggcgggc tcgccggggc gcgcgatcgc ctgcgcccgg aaggcgtcgg gaacggtggg 81600 

ttgggtggcg gcgtcgatga ccatggcaac tgcctctccg gaagggtccg ccagggccgc 81660 

gacgccgccg gcgggtcgcg ctcagctctg cggttgcttc tgcttctcca tcgccgcgat 81720 

caggctcttg ggccgcatgt cggtccagga cttctcgatg tagtcgaggc agtcctgccg 81780 

tccggcctcg ccgaacaccg tcgtccagcc gtcgggcacg tcggcgaaga ccggccagag 81840 

tgagtgctgt ccctcgtcgt tgaccaggac caggtagttg gcgtcgggat cttcaaacgg 81900 

attgggcaca gcgtcgcctc catcgaacgg atcggatgac ggtggtcagg agccgcacgg 81960 

ccggcggttg gcgccaccgc cagcactatg gccgagactc ggaacggccg acaagcgtgc 82020 

acgggctgtg cgactgcgcc gccggcgcgg ttggacagcg gtcagagcgg gccgcggccg 82080 

gcccgcagga gcatgagcgc gatggtcgtg ccatcctcgc cgagcgcggc gcggaagcgc 82140 

tggatgacct cctgctcccg ggagagcacc acccgggttc cgccggaggc cagccggatg 82200 

gagccgatct tctgcgagat cgaggccctt tcctgccaca gccgcatgat ggcggcgtcg 82260 

atctcgtcga tgcggttccg caggaccggg atgtccgcct ccggcacggc ctcgccgtgc 82320 

tcggccggcg gtgcgcccgg cgccacgacc tgaatgacgt tctccaccat cgccgtctcc 82380 

tgtgagcttg tcggtggtga ctcggggtgg gcagccggga cgttaacacc gccgcgcggc 82440 

cgtccgggcg cgtgcgactg aggccgtccg tcagtcgaac gcacgctttc gtgcgcactg 82500 

cccaaccagg ccgaacggct cctacagtcg ccgtcgatcg cagtcggtgt aacaccgtcg 82560 

aaacgccgga ggatgccttg gccgccgtgg atgtcccgag ggtgcgcccg cccggtgccg 82620 

cgcccgcgcc gcggcgtcgc cggtggcggt tctggcagtc gccggacggc cagccggcgt 82680 

gggcccgccc ggcgctgctg ggcatcgcgg cgctggcggc cgtgctgtac acggcgaacc 82740 

tcgcccgcag cggctacccc atgtactacg ccgtggcggt gaagagcatg tcggtgagct 82800 

ggccggcgtt ctggaccggc gcgttcgacc cggccgcctc gatcacgatc gacaagctcg 82860 

ccggcgcctt cgtcccgcag gcgctctccg cccgcgtctt cggcttccac cagtggtccc 82920 

tggccctgcc gcaggccgtc gagggggtca tcgccgtgct ggtcctctac cgggcggtgc 82980 

ggcgctggca cgggcccggc gccgggctgg ccgcggccgg gctgttcgcc acgacgccga 83040 

tcgtgtcctc gatgttcggc cactccatgg aggacggtgc gctgacgctc tgcctggtgc 83100 

tcgcggccga cgcgttcggc gcggcggtga cccgcggcag cccggcccgg ctggcgctcg 83160 

cgggcgcctg gatcgggctc ggcttccagg cgaagatgat gcaggcgtgg ctggtgctgc 83220 

cggccctggt cgtcacctat ctcgccggcg caccggtgcg ggcgcgggcc cgggtcgtcc 83280 
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atgtcgcggc ggcggtggcg gcgaccctgg cggtctcgct gctctgggtg ctggcgctga 83340 

ccctgctgcc cggctcgcac cggccgtggg cggacggcac cacctccggc aacgccttcg 83400 

ccatggtgtt cggctacaac ggtttcgacc gggccggcat ccacgtgccg ggcgcgctga 83460 

cgaccggctt caccgacggc ggggccgcgg ccggcggttc ctggacggcg cttgccgcgg 83520 

atcgcctcgc cacccagatc gggtggtggt acccgctggc gctgaccggc ctgctgctcg 83580 

gcctggcccg gtggcgcacc gcgcgcgccg gcctcctgtt ctggggactg tggttgctga 83 640 

cggccgcggt ggtgctcagc cggatcacca ttcagcacaa cgcctacctg gccgtgctgg 83700 

ccccgccgct ggcggcgctc gcggcggccg gcgcggtgca gctctggcgc acgcaccgcg 83760 

acggcacggc gccctggctg ctgcccgcgg tcgtggtcgt ccaggccggc tggaccctgt 83820 

ggctggccac ccgctatccc tcgttcctgg ccgggctgac gtggacggcg ccgatcgccg 83880 

ccgtcctggc cgtggtggtg ctggccgcgc ggccgacggc ccggcggccg gccgtcgtcg 83940 

tggtggtcgc cggcctgctg gcggtgccgg tcgcgtgggg cgcctcggtg ctgaacccgc 84000 

gatacgccgg cacgtcgttc gaggccggtg cggggccgag cgggccggtg ggcgtgcggc 84060 

tcgacgacga caccaccgac cggctgacgc cgggcctgcg caggctcgac gactatctcg 84120 

cggcccaccg cgacggccgc acctacctgg cggccacgtc ctcgtggcgc acggccggcc 84180 

ggctcatcgt cccgaccggg cactcctacc tgccgctcgg cggcttcagc ggagcggcgc 84240 

cgttcccgtc gctggccggc gtgcagcgcc tggtccgcga cggcgagctg cgctacttcg 84300 

tcctcggcgg cccggagggc ctcggcggcg aggccaccga ggcgtaccgc atcaccggct 84360 

gggtcctcga gacgtgcgcc accgtgccgc cggccgagca cggcgccgat ccggatctca 84420 

cggtcctgcg atgcgacaag ccctgacaac cgacgtagca cgacgaaccc gggaggaaca 84480 

gtggacaacg gcaccttcac cgatctgcgc atcgaccaca tcgaattcgc ggtcgcggac 84540 

gtcgaatccg ccagcgcccc gtttacggag ggctacggct tctcggtgta cggcgggacc 84600 

ggcgacgcgc acgcgccggt gcggcgggtg gcgctgggac gcgacgacat ccggctcgtg 84660 

ctgaccgcgg cgcccggcgg ggaccatccg gccatggcct acgtcgagca gcacggcgac 84720 

ggtgtgtcgg ccatcgcgct cagcacgaga gacgcccacg cggcgttcac cgaggcggtg 84780 

cggcgaggcg ccgtcggggt atccgccccg gtcaccggca acggcgtgac cgtcgcgacc 84840 

atccgcggct tcggtgacgt cctgcacacc ttcgtcgagc gggcgccggg cgcggacccc 84900 

cggaccctgc ccggcctgga gctgcggcgg cccagcccca cccggttcga ctcgggcctg 84960 

caggcgatcg accacatcgc cgtctgcctc gagccgggga ccctcgaccc gaccgtcgac 85020 

ttctaccgcg acgtcctcga cttcgagatg atcttcgagg agcgcatcct ggtgggccgg 85080 
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caggcgatgg actccaaggt ggtgcagagc cgctcgggcg gtgtgacgct caccctcatc 85140 

gagcccgaca cgtcgctcga gcagggccag atcgacacct tcctgaagaa ccacggcggg 85200 

ccgggcgtgc agcacctcgc gttcatcacc gacgacgtgc tgcgctcggt cggccggatg 85260 

tccgagcacg gcgtcgagtt cctgcacacc ccggactcgt actacggccg gctaccgggg 85320 

cgcatcccgc aggccgggca cccgatccag gcgctgcgcg acctgaacgt gctcgtcgac 85380 

caggaccacg acgggcagct gttccagatc ttcacgaagt cggtgcaccc ccgcgggacg 85440 

atcttcatgg aggtgatcga gcgaatgggc gctcgcagct tcggcagcgg caacatcaag 85500 

gcgctgtacg aggcggtcga gctcgacatg tccaagcaga gcgcctgagc cgccgatgga 85560 

gtcgccggca acccacgcgg aactggtgat cgggaccgtc ctgctcgaca tcgcgctggt 85620 

gctcgcggcc ggtgccctgc tcggccggtg ggtgcggcgg ctgcgccagc ccgcggtgat 85680 

cggggagatc ctcgcgggca tcgcgctcgg cccgagcctg ctcggcctgc tgccgggcaa 85740 

cccgacggcc tggctcttcc cggccgaggc ccggccgtac ctgtccgcgg tggcccagat 85800 

cggcctggcc ctgttcacgt tcctgatcgg ctgggagttc aacccggcga ctctggcccg 85860 

gcaccgcggc accgccgccg cggtgtcgat cggctccatc gcggtctcgt tcggcctcgg 85920 

catcgcgctg gccacggtgc tgcatccccg gcacgacacg accgggggcg ggaaggtcgg 85980 

cttcaccgag ttcgcgctct tcctgggcgt ggccatgtcg atcaccgcgt tcccggtgct 86040 

ggcccggatc ctcgcggagc ggcgcctcac cggcacgcgg gtgggcagca tcgcgctggt 86100 

cagcgccgcg atcgacgacg tggtggcctg gtgcctgctg gccctggtga cggccatcgc 86160 

cacggcgagc gggccggtcc agctcgtacg catcctcgcc ctgctggccg tcttcctggt 86220 

ggtgctggtg acggtcgtac ggccgctgct ggtcttgctg gcgcggcggc cgtccgcgtc 86280 

gtatcttctg gtggcggtgg tcgcggtcgt gctgctctcg gcatatgcga ccacctggat 86340 

cggcctgcac gcgatcttcg gcgcgttctg cgccggcctg gtcatgcccc gggagccggc 86400 

ggcggcgctg cgtgagcggg tgcggcagcc gctggaacac gtaagcgtgg tgctgctgcc 86460 

ggtgttcttc atcgtcaccg gcctcggcgt cgacatcggc gcgctcaccg cggcgaacat 86520 

cctcgaactc gccgcgatca tcgtgatcgc ctgcgccggc aagctggccg gcgcgatcgt 86580 

gccggcggtg tcgctcggca tgtcgtggcg ggacgccaga accctcggcc tgctcgtcaa 86640 

cacgcgcggc ctgaccgagc tcgtcgtgct caacgtcggt ctgcagctcg ccgtgctgga 86700 

cggccagatg ttcacgatga tggtgctgat ggcgctggtg acgaccgccc ttgcgggccc 86760 

gctgatcgga tcggcccgga caccggcggc cggcgcaccg gctcaggcgc tcccggccga 86820 

accgcggacg cggcgggcgg cgtaggcccg gcgcatcttg atgtagttgc cgcatccggc 86880 
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catcgagcac cagcgccgcg cctggttgcg ggaggcgtcg tagtaggccc accggcaggt 86940 

gtcccggtcg cagaccttca gccgggtcca cacctgcagc tcggcgcact gccggaccgc 87000 

gtcgatcagc ccggccaggg cacggtcgaa cggcgtgccc ccggccggca gcaggcgcgg 87060 

cgcgccgccg gccaggctca gccgcacggg aacggcggcc aggatctcct ccaggcggcg 87120 

cagcgcggcc gggtcggcgg gatggccggc gtggcccagc agcacctggc gaaggccctc 87180 

acgaagcgcg acggcccgtg ccaggtcggc cgggcgcacc ctcgcgccgg gggcgagcag 87240 

gctttccccg gccagccacg cccgaagggc gtccggggtg ctcagcgact cgtcgtcgac 873 00 

ctgaggctca taggtgttga cgaaatcgcg caacagccgg gcaggggatg gcactgcggc 873 60 

ctcgtcactc accgttgacc tccttggtgg cgacagcgta gcgccttgac aggcgcgcga 87420 

accgcctgga acgatccgct ttcacaccag caaaactact ttggtggtgt gaagaggcgc 87480 

gaaggagcgg atcgtggcca ccacgctgag ggacgtggca cggctcgccc gggtgtcggt 87540 

gaagacggtc tccaacgtcg tcaacgacca cccgcacgtc agcgacgacg tgcgccgccg 87600 

ggtcgagacg gcgatccggc agctgggcta ccgtcccaac ctcgtcgccc gcgccctgcg 87660 

cagcggccgc ggcagcggac tgctcgccct ggcgatgccg ggcgccggcg cgccgcagtc 87720 

gcccgccctg atcgaggaga tcatccggcg ggcggccccg ctcggattcc gggtcctcat 87780 

cgagccgctc gagtcgtcac ggccgaggcc gccggccccc ggcgtcgacg cccggctgct 87840 

gaacgcggag gcgccggccc ccgagctggt ggacgcccag gcggccacgg gcaccccgct 87900 

ggtgctgctg accggcaccc ccgatccgcg atacgactgc gtcggcccgg acgccgcccg 87960 

cgcggccgag gacgcggtgg accacctgcg ccgtctcggc cggcgccgcg tcgccaccat 88020 

cggcggctcg ctctccaccg gtccggccgg ctccggctcc gacttcggtt ccggctccgg 88080 

ttccggctcc ggctccggtt ccggctccgg ctcgggctcc ggctcgggct cgggctccgg 88140 

ctcgggctcc ggcttcggct cgggctccgg cttcggctcg ggctccgcgg agggctaccg 88200 

ggccgcacgg cagttgttag gccacgaaga tcgcccggac gcgatcgtct gcggcagcgt 88260 

gcggctggcg gtcggcgtga tccgggccgc cgccgacgcc ggcctgcggg tgcccgagga 88320 

cgtcgccgtg atcggcatcg gcgacggcga ggagggccgc tacacgcggc cggccctgac 88380 

cacggtcgcc accgacccgg cgttcatcgc cggcaaggcg c 88421 



<210> 2 

<211> 333 

<212> PRT 

<213> Actinoplanes sp. 



<400> 2 
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Met Ser Trp Arg Gin Phe Arg Trp Gin Ala Leu Ala Gly Ala Val Ala 
15 10 15 

Leu Val Pro Leu Val Ala Tyr Leu lie Val Thr Ser Leu Asp lie Arg 
20 25 30 

Arg Ala His Asp Arg Tyr Gin Ala Gin Cys Ala Ser lie Gly Asn Cys 
35 40 45 

Ala Glu Ala Met Leu Gin Phe Gin Asn Asp Phe Arg Thr Arg Leu Leu 
50 55 60 

Leu Leu Ala He Leu Leu Ala Ala He Pro Gly He Leu Gly Val Phe 
65 70 75 80 

Trp Gly Ala Pro Leu Val Ala Arg Glu Leu Glu Thr Gly Thr His Arg 
85 90 95 

Leu Val Trp Asn Gin Ser Val Thr Arg Arg Arg Trp Leu Ala Val Lys 
100 105 110 

Val Leu Phe Val Gly Val Ala Ala Met Ala Val Ala Thr Leu Val Ser 
115 120 125 

Thr Leu Leu Thr Trp Ala Ser Ser Pro Val Asp Ala Val Ser Gin Asp 
130 135 140 

Arg Phe Gly Ala Leu Val Phe Asp Ala Arg Asn He Val Pro Val Ala 
145 150 155 160 

Tyr Ala Ala Phe Ala Leu Val Leu Gly Thr Val He Gly Leu Leu Val 
165 170 175 

Arg Arg Thr He Pro Ala Met Ala Leu Thr Met Leu Val Phe Ala Val 
180 185 190 

Val Gin Phe Thr Val Pro Ala Leu Ala Arg Pro His Leu Met Ala Pro 
195 200 205 

Glu Thr Gin Thr Arg Gin Met Thr Leu Gin Glu Phe Gly Glu Val Arg 
210 215 220 

Gly Phe Gly Asp Glu Pro Thr Val Asn Gly Leu Ser He Arg Gly Ala 
225 230 235 240 

Trp Val Thr Ser Thr Ser Pro Leu Leu Thr Ala Asp Gly Thr Arg Leu 
245 250 255 

Asp Lys Ala Thr Tyr Arg Lys Cys Val Thr Asp Pro Pro Ala Val Ser 
260 265 270 

Gly Gly Ala Pro Gly Val Gly Gly Thr Val Ala Cys Leu Ala Asp Leu 
275 280 285 

Asp Leu His Val Glu Val Ala Tyr Gin Pro Asn Asp Arg Tyr Trp Thr 
290 295 300 



Phe Gin Trp He Glu Ser Ala Leu Tyr Leu Ala Leu Gly Gly Leu Leu 
305 310 315 320 



54 



Leu Ala Val Gly Leu Trp Arg He Arg Arg His Val He 
325 330 

<210> 3 
<211> 304 
<212> PRT 

<213> Actinoplanes sp. 
<400> 3 

Met Pro His Glu Asp Ser Ser Pro Val Leu Gin Ala Glu Gly Leu Thr 
1 5 10 15 

Lys Arg Tyr Gly Arg Arg Thr Ala Leu Gin Asp Cys Asn Leu Thr He 
20 25 30 

Pro Arg Gly Arg Val He Gly Leu Val Gly Pro Asn Gly Ala Gly Lys 
35 40 45 

Ser Thr Leu Leu Gin Leu Ala Cys Gly Leu He Thr Pro Ser Glu Gly 
50 55 60 

Ser Leu Arg Val Leu Gly Glu Thr Pro Ala Ala Asn Ala Gly His Leu 
65 70 75 80 

Ala Lys Val Gly Phe Val Ala Gin Asp Thr Pro Val Tyr Ser Asn Phe 
85 90 • 95 

Thr Val Gly Asp His Leu Lys Met Gly Ala Lys Leu Asn Pro Thr Trp 
100 105 110 

Asp Gin Ala Leu Ala Glu Arg Arg Val Ala Gin Val Gly Leu Asn His 
115 120 125 

Gly Gin Lys Ala Gly Arg Leu Ser Gly Gly Gin Arg Ala Gin Leu Ala 
130 135 140 

Leu Thr Leu Ala Ala Ala Lys Arg Pro Glu Leu Leu Met Phe Asp Glu 
145 150 155 160 

Pro Ala Ala Ala Leu Asp Pro Leu Ala Arg Asp Gly Phe Leu Gin Asn 
165 170 175 

Leu Leu Glu Phe Val Thr Glu Leu Asp Ala Ser Ala He Leu Ser Ser 
180 185 190 

His Leu Leu Gly Asp Val Glu Arg Val Cys Asn Tyr Leu He Val Leu 
195 200 205 

Cys Ala Ser Arg Val Gin Val Ala Gly Asp Val Pro Asp Leu Leu Asn 
210 215 220 

Thr His Tyr Arg He Val Ala Pro Arg Gly Glu Leu Asp His Pro Pro 
225 230 235 240 

Ala Gly Leu Glu Val He Arg Ala Gin His Ala Asp Arg Tyr Thr Thr 
245 250 255 

Ala Val Val Arg Gly Asp Gly Ser Arg Pro Ser Thr Trp Thr He Glu 
260 265 270 
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Pro lie Gin Leu Glu Glu Leu Val Leu Ala Tyr Met Thr Arg Ala Met 
275 280 285 



Gly Val Thr Gly Glu Pro Leu Met Ala Ala Ser Gly Glu Val Val Arg 
290 295 300 

<210> 4 
<211> 336 
<212> -PRT 

<213> Actinoplanes sp. 
<400> 4 

Met Ser Trp Arg Gin Phe Arg Gly Gin Ala Val Val Gly Val Val Val 
15 10 15 

Leu Ala Leu Leu Ala Ala Tyr Leu Val Tyr Leu Gly Val Asp lie Arg 
20 25 30 

Gly Ala Tyr Asp Asp Tyr Arg Ala Gin Cys Pro Ala Gly Gly Asp Cys 
35 40 45 

Ala Gly Pro Leu Gly Gin Phe Ser Leu Asp Tyr Glu Asn Thr Leu Leu 
50 55 60 

Tyr Leu Ala Gly Val Leu Ala Leu Val Pro Gly Leu Leu Gly Met Phe 
65 70 75 80 

Trp Gly Ala Pro Leu lie Thr Arg Glu Leu Glu Asn Gly Thr Gin Arg 
85 90 95 

Leu Val Trp Asn Gin Ser Val Thr Arg Arg Arg Trp Leu Leu lie Lys 
100 105 110 

Leu Leu Val Val Gly Leu Ala Cys Met Val Val Ala Gly Val Pro Ser 
115 120 125 

Leu Leu Leu Thr Trp Ala Ala Ala Pro Val Asp Asn Val Ala Asp Asn 
130 135 140 

Arg Phe Ser Thr Val Met Phe Gly Ala Arg Phe Leu Pro Pro lie Ala 
145 150 155 160 

Tyr Ala Ala Phe Ala Phe Val Leu Gly Thr Leu lie Gly Leu Leu Val 
165 170 175 

Arg Arg Thr Val Pro Ala Met Ala Leu Thr Leu Val Ala Phe Val lie 
180 185 190 

Phe Gin Phe Leu Val Pro Asn Leu Val Arg Pro His Leu Met Pro Ala 
195 200 205 

Lys His Leu Val Lys Pro Met Thr Val Ser Ala lie Asn Glu Ala Lys 
210 215 220 

Ser Leu Gly Ser lie Thr Gly Ala Pro Val Leu Asn Gly Leu Ser lie 
225 230 235 240 



Ser Gin Gly Trp lie Thr Asp Val Ser Ala Leu Lys Thr Ala Asp Gly 

56 



245 



250 



255 



Arg Ser Leu Asp Ala Lys Thr Phe Asp Asn Cys Tyr Met Asn Ala Pro 
260 265 270 

Lys Thr Gly Ala Thr Glu Gly Pro Tyr Gly Asp Val Ala Val Cys Leu 
275 280 285 

Ala Lys Leu Asp Leu His Val Asp He Ala Tyr Gin Pro Trp Asn Arg 
290 295 300 

Tyr Trp Ala Phe Gin Phe Leu Glu Ser Gly Phe Tyr Val Leu Leu Ser 
305 310 315 320 

Gly Leu Leu He Gly Ala Ala Val Trp Arg Val Gin Arg Arg Pro Ser 
325 330 335 

<210> 5 
<211> 283 
<212> PRT 

<213> Actinoplanes sp . 
<220> 

< 2 2 1> mi sc_f ea ture 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will contain a methionineresidue at t 
his position 



<400> 5 

Val Arg Ser Ala Val Val Val Gly Thr Gly Leu He Gly Thr Ser Val 
15 10 15 

Gly Leu Ala Leu Thr Gin Arg Asp He Thr Val His Leu Leu Asp Ala 
20 25 30 

Asp Pro Ala Ala Ala Arg Ala Ala Ala Ala Leu Gly Ala Gly He Ala 
35 40 45 

Gly Glu Pro Arg Thr Arg Val Asp Val Ala Val He Ala Val Pro Pro 
50 55 60 

Ala Ala Val Ala Pro Val Leu Ala Asp Leu Gin Arg Arg Gly Thr Ala 
65 70 75 80 

Arg Val His Thr Asp Ala Ala Ser Val Lys Val Leu Pro Ser Arg Gin 
85 90 95 

He Glu Val Leu Gly Cys Asp Ala Ser Ser His Val Gly Gly His Pro 
100 105 110 

Leu Ala Gly Ser Glu Arg Ser Gly Pro His Ala Ala Arg Gly Ser Leu 
115 120 125 



Phe Glu Gly Arg Pro Trp Val Leu Ser Pro Gly Arg Arg Ser Ser Thr 
130 135 140 



Ala Ala Val Asp Gly Ala Leu Ala Val Val Ser Ala Cys Gly Ala Thr 

57 



145 



150 



155 



160 



Pro Val Leu Met Ser Ala Glu Glu His Asp Arg Ala Val Ala Leu Val 
165 170 175 

Ser His Val Pro His Leu Val Ala Gly Leu Leu Ala Ala Arg Met Leu 
180 185 190 

Asp Gly Thr Pro Ala Gin Leu Gly Leu Ala Gly Gin Gly Val Arg Asp 
195 200 205 

Thr Thr Arg lie Ala Gly Gly Arg Ala Ala Leu Trp Thr Glu lie Leu 
210 215 220 

Ala Ala Asn Ala Gly Ala Val Ala Asp Val Leu Asp Asp Leu Ser Ala 
225 230 235 240 

Glu Leu Ala Ala Thr lie Ser Ala Leu Arg Glu Leu Glu Ala His Pro 
245 250 255 

Gly Arg Ala Glu Ala Leu Ala Ala Leu Thr Gly Met Leu Gin Arg Gly 
260 265 270 

Val Asp Gly Arg Asp Arg lie Ala Ala Ser Pro 
275 280 

<210> 6 
<211> 336 
<212> PRT 

<213> Actinoplanes sp. 
<400> 6 

Met Glu Ser Leu His lie Ala Ser Ala Arg His Glu Pro Asp Arg His 
15 10 15 

Asp Glu Thr Gin Met Asn Thr Pro Ser Met Met Arg Val Glu Trp Leu 
20 25 30 

Pro Val Asp Ser Leu Glu Met Leu Asp Ser Pro Arg Leu Ala Gly Glu 
35 40 45 

Asp Pro Arg His Thr Gin Met Leu Ala Ser Leu Asp Ala Glu Leu Pro 
50 55 60 

Pro lie He Val His Arg Ala Ser Met Arg Val He Asp Gly Ala His 
65 70 75 80 

Arg Leu Gly Ala Ala Arg Leu Arg Gly Asp Glu Leu He Lys Ala Ala 
85 90 95 

Met Phe Glu Gly Ser Glu Gin Glu Ala Phe Val Leu Gly Val Lys Ala 
100 105 110 

Asn He Ala His Gly Leu Pro Leu Ser Thr Ala Asp Arg Thr Arg Ala 
115 120 125 

Ala Glu Arg He He Glu Ser His Pro Ser Trp Ser Asp Arg Thr He 
130 135 140 



58 



Ala Ala Ser Ser Gly Leu Ser Ala Arg Thr Val Gly Asn lie Arg Arg 
145 150 155 , 160 

Arg Leu Glu Leu Ser Gly Asp lie Gly Gin Gly Ser Arg Thr Arg Val 
165 • 170 175 

Gly Arg Asp Gly Arg Val Arg Pro Leu Asp Asn Ser Glu Gly Arg Leu 
180 185 190 

Lys Ala Val Ser Tyr lie Gin Gin Gin Pro Asp Ala Ser Leu Arg Glu 
195 200 205 

lie Ala Lys Asn Ala Gly Val Ser Pro Ser Thr Ala Arg Asp Val Arg 
210 215 220 

Asn Arg Leu Gin Arg Gly Glu Asp Pro Leu Pro Gly Pro Arg Arg Thr 
225 230 235 240 

Gly Gly His Arg Asp Asp lie Ser Phe Asp Lys Glu Asn Thr lie Arg 
245 250 255 

Leu Leu Glu Pro Thr Val Arg Ser lie Leu Gin Gly Leu Lys Asn Asp 
260 265 270 

Pro Ser Leu Arg Phe Thr Glu Ser Gly Arg Asn Leu Leu Arg Trp Val 
275 280 285 

Leu Ala Arg Thr Val Gin Asp Asp Glu Trp Lys Asp Met Leu Asp Ala 
290 295 300 

Val Pro Ser His Cys Thr Tyr Val Leu Ala Asn Val Ala Arg Arg Cys 
305 310 315 320 

Ser Gin Glu Trp Leu Glu Phe Ala Glu Thr Leu Glu Lys Asn Ala Ala 
325 330 335 

<210> 7 
<211> 444 
<212> PRT 

<213> Actinoplanes sp. 
<400> 7 

Met Ser lie Leu Arg Glu Ala Pro Gly Thr Gly Arg Val Leu Arg Arg 
15 10 15 

Glu Asp Leu His Gin Ser Leu Ser Asp Pro Leu Leu Asp Thr Met Asn 
20 25 30 

Phe Leu Asn Glu Val Thr Ala Arg Tyr Pro Arg Ala Val Ser Phe Ala 
35 40 45 

Pro Gly Arg Pro Phe Asp Gly Phe Phe Asp Val Glu Gin lie Phe Arg 
50 55 60 

Gly lie Arg Gly Tyr Leu Glu His Leu Ala Gly Gin Gly Arg Ser Pro 
65 70 75 80 

Ala Glu lie Arg Asp Ala Val Phe Gin Tyr Gly Pro Ala Ala Gly Arg 
85 90 95 



59 



lie Arg Glu Val lie Ala Gin Trp Leu Arg Arg Asp Glu Gly lie Asp 
100 105 110 



Val Ala Pro Glu Ser He Val Val Thr Val Gly Ala Gin Glu Ala Met 
115 120 125 

Leu Leu Ala Leu Arg Ala Leu He Arg Asp Glu Arg Asp Ala Leu Phe 
130 135 140 

Val Ala Ser Pro Cys Tyr Val Gly He Thr Gly Ala Ala Arg Leu Leu 
145 150 155 160 

Asp He Asp Pro Val Pro Val Ala Glu Arg Glu Asp Gly Phe His Pro 
165 170 175 

Glu Asp Leu Ala Arg Ala Val His Ala Glu Leu Ser Arg Gly Arg Arg 
180 185 190 

Pro Arg Ala Phe Tyr Val Val Pro Asp His Thr Asn Pro Ser Gly Ala 
195 200 205 

Thr Met Pro Leu Glu Ala Arg His Ala Leu Leu Asp Leu Ala Gly Glu 
210 215 220 

Leu Gly Leu Leu Val lie Glu Asp Ser Pro Tyr Arg Leu Val Ser Pro 
225 230 235 240 

Gly Gin Gin Leu Pro Ser Leu Lys Ala Leu Asp Pro Gly Arg His Val 
245 250 255 

Val His Leu Gly Ser Phe Ser Lys Thr Leu Phe Pro Gly Ala Arg Val 
260 265 270 

Gly Phe Ala He Ala Asp Gin Pro Val Ser Asp Ala Ala Gly Gly Ala 
275 280 285 

Gly Leu Leu Ala Asp Glu Leu Ala Lys Val Lys Ser Met Val Thr Val 
290 . 295 300 

Asn Thr Ser Pro Leu Ser Gin Ala Ala Val Ala Gly Met Leu Leu Ala 
305 310 315 320 

Ala Gly Gly Thr Ala Ala Glu Ala Ser Ala Glu Ser Ser Ala His Tyr 
325 330 335 

Gly Ala Ala Met Arg Arg Thr Leu Asp Arg Leu Glu Glu His Leu Pro 
340 345 350 

Ala Ser Phe Arg Ala Arg Thr Gly Val Arg Trp Asn Arg Pro Ser Gly 
355 360 365 

Gly Phe Phe Leu Ala Val Asn Val Pro Phe Thr Ala Asp Asn Ala Ala 
370 375 380 

Leu Ser Arg Ser Ala Glu Asp His Gly Val He Trp Thr Pro Met Ser 
385 390 395 400 

Tyr Phe Tyr Pro Ala Gly Gly Gly Glu Gin Gly lie Arg Leu Ser lie 
405 410 415 



60 



Ser Tyr Leu Thr Pro Glu Glu lie Asp Glu Gly Val Lys Arg Leu Ala 
420 425 430 



Gly Phe lie Thr Thr Glu lie Ala Ala Leu Arg Pro 
435 440 

<210> 8 
<211> 356 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 8 

Val Thr Ala Thr Ala Leu Leu Pro Leu Thr Leu Ala Asp Tyr Glu Gin 
15 10 15 

Leu Ala Gin Ala Arg Met Glu Pro Pro Val Trp Asp Phe lie Ala Gly 
20 25 30 

Gly Ala Gly Glu' Glu Leu Thr Leu Ala Ala Asn Thr Ala Ala Phe Ala 
35 40 45 

Pro Pro Arg Leu Arg Pro Arg Val Leu Thr Gly Ala Gly Ala Pro Asp 
50 55 60 

Thr Gly Thr Thr lie Leu Gly Arg Arg Trp Ala Ala Pro lie Gly Val 
65 70 75 80 

Ala Pro Leu Gly Tyr His Thr Leu Val Asp Pro Ala Gly Glu Val Ala 
85 90 95 

Thr Ala Ala Ala Ala Gly Ala Ala Gly Leu Pro Leu Val Val Ser Thr 
100 105 110 

Phe Ser Gly Arg Thr Val Glu Asp lie Ala Ala Ala Thr Thr Ala Pro 
115 120 125 

Arg Trp Leu Gin Val Tyr Cys Phe Arg Asp Arg Ala Val Thr Ala Ala 
130 135 140 

Leu Val Thr Arg Ala Val Arg Ala Gly Phe Glu Ala Leu Val Leu Thr 
145 150 155 160 

Val Asp Ala Pro Arg Leu Gly Arg Arg Leu Arg Asp lie Arg Asn Asp 
165 170 175 

Phe Arg Leu Pro Pro Gly Val Ala Pro Ala Asn Leu Thr Gly Asp Gly 
180 185 190 

Phe Ala Ser Pro Ser Gly His Ala Leu Gly Ala Phe Asp Ala Ala Met 
195 200 205 



61 



Asp Trp Thr Val Val Ala Trp Leu Arg Glu Leu Ser Gly Leu Pro Val 
210 215 220 



Leu Leu Lys Gly Val Leu Thr Ala Asp Gly Ala Arg Arg Ala Leu Asp 
225 230 235 240 

Ala Gly Ala Asp Gly He Val Val Ser Asn His Gly Gly Arg Gin Leu 
245 250 255 

Asp Gly Val Pro Ala Thr Leu Asp Val Leu Pro Glu Val Val Ala Ala 
260 265 270 

Val Ala Gly Arg Cys Pro Val Leu Leu Asp Gly Gly Val Arg Arg Gly 
275 280 285 

Arg Asp Val Leu Leu Ser Leu Ala Leu Gly Ala Asp Ala Val Leu Val 
290 295 300 

Gly Arg Pro Val Leu Tyr Gly Leu Ala Val Gly Gly Thr Ala Gly Val 
305 310 315 320 

Arg His Val Leu Asp He Leu Ala Gly Glu Leu Thr Asp Asp Met Ala 
325 330 335 

Leu Ala Gly Val Ala Ser Pro Ala Asp Ala Gly Ala Asp Leu Ala Gly 
340 345 350 

Pro Val Ala Pro 
355 

<210> 9 
<211> 640 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine at this 
position 



<400> 9 

Val Ala Thr He Asp Gly Pro Asp Leu Gly Val He Gly Leu Arg Val 
15 10 15 

Asp Gly Leu lie Pro Met Gin Lys Val Arg Pro Gly Thr Val Arg Arg 
20 25 30 

He Leu Pro Tyr Ala Lys Lys His Arg Trp Ser Leu Ala Val Ala Leu 
35 40 45 

Leu Met Thr Val Val Asp Ala Ala Leu Thr Val Ala Asn Pro Leu Leu 
50 55 60 

Leu Lys Gin He He Asp Arg Gly He Val Ala Gly Arg Leu Asp Val 
65 70 75 80 



62 



Val Val Gly Leu Ser Leu Val Val Ala Gly Leu Ala Leu Val Asn Val 
85 90 95 

Ala Ala He His Val Gin Thr Leu Ala Ser Gly Arg Val Gly Gin Gly 
100 105 110 

Leu He Tyr Asp Leu Arg Thr Lys Val Phe Ala His Val Met Arg Gin 
115 120 125 

Pro Leu Ala Phe Phe Thr Arg Ala Gin Thr Gly Ser Leu Val Ser Arg 
130 135 140 

Leu Asn Thr Asp Val Val Gly Ala Glu Gin Ala Met Thr Ser Met He 
145 150 155 160 

Thr Gin Thr Val Ser Thr Val Leu Thr Val Val Leu Val He Gly Ala 
165 170 175 

Met Phe Tyr Leu Ser Trp Ala He Ala Leu Val Ala Leu Val Leu He 
180 185 190 

Pro Leu Phe Phe Leu Pro Gly Lys Leu He Ala Gly Arg Leu Glu Arg 
195 200 205 

Leu Ala Arg Gly Gly Met Gin Val Asp Ala Glu Leu Gly Ser Met Met 
210 215 220 

Asn Glu Arg Phe Asn Val Ser Gly Ala Met Leu Val Lys Leu Tyr Gly 
225 230 235 240 

Arg Pro Glu Ser Glu Glu Thr Ala Phe Ala Gly Arg Ala Ala Arg Val 
245 250 255 

Arg Asp He Ala He Ser Met Gly Val His Ala Arg Leu Leu Phe He 
260 265 270 

He Ala Thr Leu Leu Thr Thr Val Thr Thr Ala Met Val Tyr Gly Phe 
275 280 285 

Gly Gly Ala Leu Val He Asp Gly Thr Leu Gly He Gly Thr Leu Val 
290 295 300 

Ala Met Val Ala Leu Leu Ala Gin Leu Tyr Gly Pro Val Asn Gin Leu 
305 310 315 320 

Thr Asn He Gin Val Asp Val Val Thr Ala Leu Val Ser Phe Asp Arg 
325 330 335 

Val Phe Glu Val Leu Asp Leu Asp Pro Leu Val Lys Glu Arg Pro Gly 
340 345 350 

Ala Arg Ala Leu Pro Ala Ala Glu Pro Gly Arg Ser Ala Ala Pro Asp 
355 360 365 

He Glu Phe Asp Asn Val Val Phe Arg Tyr Pro Gly Ala Asp Glu Val 
370 375 380 

Ser Leu Ala Ser Leu Glu Thr Val Ala Gin Arg Ser Ser Asp Gly Thr 
385 390 395 400 



63 



Ala Glu Arg Pro Val Leu Asn Gly lie Ser Phe Leu Ala Pro Ala Gly 
405 410 415 



Lys Leu Thr Ala Leu Val Gly Pro Ser Gly Ala Gly Lys Thr Thr lie 
420 425 430 

Thr His Leu Val Pro Arg Leu Tyr Asp Thr Thr Ser Gly Thr Val Arg 
435 440 445 

lie Ala Gly His Asp Val Arg Asp Leu Thr Leu Arg Ser Leu Ser Glu 
450 455 460 

Ser lie Gly Val Val Thr Gin Asp Ala His Leu Phe His Asp Thr lie 
465 470 475 480. 

Arg Ala Asn Leu Leu Tyr Gly Arg Pro Asp Ala Gly Glu Arg Asp Leu 
485 490 495 

Val Ala Ala Cys Glu Ala Ala Arg lie Trp Glu Met Val Ser Ser Leu 
500 505 510 

Pro Asp Gly Leu Asp Thr Val Val Gly Asp Arg Gly Tyr Arg Leu Ser 
515 520 525 

Gly Gly Glu Lys Gin Arg Leu Ala Leu Ala Arg Leu Leu Leu Lys Ser 
530 535 540 

Pro Pro Val Val Val Leu Asp Glu Ala Thr Ala His Leu Asp Ser Glu 
545 550 555 560 

Ser Glu Ala Ala lie Gin Arg Ala Leu Asp Thr Ala Leu Ala Gly Arg 
565 570 575 

Thr Ser Leu Val lie Ala His Arg Leu Ala Thr lie Leu Asp Ala Asp 
580 585 590 

Gin lie Leu Val lie Asp Asp Gly Arg Val Val Glu Arg Gly Thr His 
595 600 605 

Asp Glu Leu lie Ala His Gly Gly Leu Tyr Ala Glu Leu Tyr Arg Thr 
610 615 620 

Gin Phe Ala Gly Gin Arg Thr Glu Glu Arg Gin Pro Ala Val Pro Ser 
625 630 635 640 

<210> 10 
<211> 271 
<212> PRT 

<213> Actihoplanes sp. 
<220> 

<221> inisc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 10 



64 



Val Ser Ala Ala Gly Ser Gly Phe Val Thr Thr Asn Gly Val Arg Leu 
15 10 15 



Ala Tyr Arg Arg Ser Gly Ala Gly Glu Pro Val Leu Met lie Met Gly 
20 25 30 

Ser Gly Ser Ala Gly Gin Thr Trp Thr Val His Gin Thr Pro Ala Leu 
35 40 45 

His Glu Ala Gly Tyr Ser Thr Val Val Phe Asp Ser Arg Gly lie Pro 
50 55 60 

Pro Ser Asp Val Pro Ala Gly Lys Tyr Ser Leu Ala Asp Met Thr Ala 
65 70 75 80 

Asp Thr Arg Gly Leu lie Glu Ala Leu Asp Leu Ala Pro Cys Arg lie 
85 90 95 

Val Gly Thr Ser Leu Gly Ala Met lie Ala Gin Glu Leu Ala Val Asp 
100 105 110 

His Pro Glu Leu Val Arg Cys Ala Val Leu lie Ala Thr Leu Ala Arg 
115 120 125 

Pro Asp Ala Ala Arg Ala Ala Gin Asn Gin Ala Asp lie Asp Leu Leu 
130 135 140 

Glu Ser Gly Val Thr Leu Pro Ala Ala Tyr Glu Ala Ala Thr Ala Val 
145 150 155 160 

Phe Lys Met Phe Ser Pro Ala Thr Leu Asn Asp Asp Val Ala Val Arg 
165 170 175 

Glu Trp Leu Asp He Phe Glu Leu Ser Gly Thr Gly Val Ser Ala Gly 
180 185 190 

Gly Gin Ala Trp Ala Glu Leu Thr Gly Asp Arg Arg Ala Ala Leu Arg 
195 200 205 

Ser Val Thr Ala Pro Cys Arg Val He Ser Phe Ala Asp Asp Leu He 
210 215 220 

Thr Pro Pro His Leu Ala Ala Glu Val Ala Glu Ala He Pro Asp Cys 
225 230 235 240 

Asp Leu Val Glu He Ser Arg Cys Gly His Leu Gly Tyr Leu Glu Arg 
245 250 255 

Pro Asp Ala Val Asn Ala Ala He Leu Glu Phe Leu Asp Ser His 
260 265 270 

<210> 11 
<211> 529 
<212> PRT 

<213> Actinoplanes sp. 
<400> 11 

Met Gly Asn Ala Asp Gin Pro Arg Tyr Leu Arg Ser Asn Val He Ala 

65 



1 



5 



10 



15 



Glu Pro Leu Val Asp Arg Phe Tyr Ala Trp Leu His Thr Val Ala Pro 
20 25 30 

Val Pro Ala Ser Met Asn Leu Ala Phe Leu Gin Val Pro Leu Leu Glu 
35 40 45 

Ser Tyr Leu Gin Ser Pro Pro Val His Val Ala Ala Ser Thr Asn Pro 
50 55 60 

Lys Met Arg Gly Gly Tyr Phe Val Ala Val Glu Glu Ser Arg Ser Asp 
65 70 75 80 

Glu Val Ala Glu Leu Leu Lys Thr lie Lys Asn Glu Arg Ala Asp Met 
85 90 95 

Leu Gly Phe Ala Ala Ala Val Ala Glu Ala Glu Asp Leu lie Arg Glu 
100 105 110 

Asn Ala Val Gly Tyr Asp Leu Thr Pro Leu Tyr Pro Arg Leu Pro Ala 
115 120 125 

Ala Leu Asn Gly Leu Val Glu lie Ala Tyr Asp Thr Ser Asn Gin Pro 
130 135 140 

Ser Leu His Phe Leu Glu Pro Leu Leu Tyr Arg Ser Pro Ala Tyr Asp 
145 150 155 160 

Glu Arg Arg Gin Ser Val Gin Leu Ser Leu Asp Asp Gly Val Glu Arg 
165 170 175 

Pro Phe lie Leu Ser Thr Pro Arg Leu Pro Arg Ala Gly Val Leu Asp 
180 185 190 

Leu Pro Leu Pro Leu Arg His Pro Gly Leu Thr Glu Leu Phe Asp Ala 
195 200 205 

Arg Val Arg Pro Thr Ser Leu Asn Arg Leu Arg Glu Ala Leu Glu Leu 
210 215 220 

Asp Asp Ala Gly Ala Ala Ala Leu Asp Ala Leu Leu Thr Asp Glu Pro 
225 230 235 240 

Ser Leu Ser Pro Asp Arg His He Glu Ser Gly Gly Arg Val Arg Tyr 
245 250 255 

Tyr Gly His Ala Cys Val Val Met Gin Thr Glu Gin Ala Ala Val Val 
260 265 270 

Thr Asp. Pro Phe He Ser Thr Asp Asn Arg His Gly Asp Arg Tyr Thr 
275 280 285 

Leu Asp Asp Leu Pro Asp His He Asp Leu Val Leu He Thr His Gly 
290 295 300 

His Gin Asp His He Val Leu Glu Thr Leu Leu Gin Leu Arg Gly Arg 
305 310 315 320 



He Gly Thr Val Val Val Pro Arg Thr Ser Arg Gly Asn Leu Pro Asp 

66 



325 



330 



335 



Pro Ser lie Ala Leu Tyr Leu Arg Arg lie Gly Phe Thr Val Val Glu 
340 345 350 

Val Glu Glu Phe Asp Glu Val Pro Phe Pro Gly Gly Thr Val Thr" Ala 
355 360 365 

Thr Pro Phe Leu Gly Glu His Ala Asp Leu Asp lie Arg Gly Lys Ser 
370 375 380 

Thr Tyr Phe Val Arg Met Ala Gly Arg Thr lie Phe lie Gly Ala Asp 
385 390 395 400 

Ser Ser Gly lie Asp Pro Val Leu Tyr Arg Tyr lie Arg Asp His Val 
405 410 415 

Gly Gin Val Asp Met Ala Phe Leu Gly Met Glu Cys Asp Gly Ala Pro 
420 425 430 

Leu Asn Trp Leu Tyr Lys Gly Leu Leu Thr Lys Pro Val Asn Lys Lys 
435 440 - 445 

Met Ser Ala Ser Arg Arg Leu Ser Gly Ser Asn Ala Glu Gin Ala Gly 
450 455 460 

Ala lie Met Thr Glu Leu Gly Ala Thr Ala Gly Tyr lie Tyr Ala Met 
465 470 475 480 

Gly Glu Glu Ser Trp Gin Gly His Val Met Ala Thr Thr Tyr Asn Glu 
485 490 495 

Asp Thr Tyr Gin Leu Lys Gin lie Asp Glu Phe Leu Ala Trp Cys Ala 
500 505 510 

Asp Arg Gly Phe Thr Ala Glu His Leu Phe Asn Lys Arg Glu Trp Arg 
515 520 525 



Trp 



<210> 12 
<211> 90 
<212> PRT 

<213> Actinoplanes sp. 
<400> 12 

Met Ser Glu Thr Asp Leu Ser Ala Ala Arg His Thr Pro Glu Gin lie 
15 10 15 

Arg Ser Trp Leu lie Asp Arg lie Ala Tyr Tyr Val Met Leu Pro Thr 
20 25 30 

Gin Glu lie Glu Pro Asp Val Ser Leu Ala Glu Tyr Gly Leu Asp Ser 
35 40 45 

Val Tyr Ala Phe Ala Leu Cys Gly Glu lie Glu Asp Thr Leu Gly lie 
50 55 60 



67 



Pro lie Glu Pro Thr Leu Leu Trp Asp Val Asp Thr Val Ala Thr Leu 
65 70 75 80 



Thr Ala His Leu Ala Asp Arg Val Asn Arg 
85 90 

<210> 13 
<211> 1051 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> (1)..<1) 

<223> V represents a non-standard codon. It is expected that the biosy 
nthesized protein will have a f ormylmethionine residue at this po 
sition 



<400> 13 

Val Pro Thr Pro Asp Leu Arg Pro Leu Thr Pro Ala Gin Leu Ala Val 
15 10 15 

Trp His Ala Gin Gin Leu Ala Pro His Ser Pro Val Tyr Gin Val Gly 
20 25 30 

Glu Phe Val Glu lie Asp Gly Glu Cys Asp Pro Asp Leu Leu Val Ala 
35 40 45 

Ala Leu Arg Gin Val Met Gly Glu Ala Glu Ser Ala Arg Leu Arg Phe 
50 55 60 

Arg Val lie Asp Gly Thr Pro Trp Gin Tyr Val Ala Glu Asp Gly Asp 
65 70 75 80 

Asp Pro lie Gin Val Val Asp Leu Gly Ala Ala Ala Asp Pro Arg Ala 
85 90 95 

Ala Ala Leu Gly Arg Met Ala Ala Asp Leu Asp Arg Pro Gly Asp Leu 
100 105 110 

Arg Asp Gly Pro Leu Val Glu His His Val Tyr Leu Leu Gly Glu Gly 
115 120 125 

Arg Val lie Trp Tyr His Arg Ala His His lie Val Cys Asp Gly Gly 
130 135 140 

Ser Leu Gly lie Val Ala Ser Arg Val Ala Gly Val Tyr Ser Ala Leu 
145 150 155 160 

Ala Ala Gly Gly Asp Val Arg Pro Gly Ala Leu Pro Pro Leu Ser Val 
165 170 175 

Leu Leu Ser Ala Ala Asp Ala Tyr Glu Arg Ser Gly Asp Arg Asp Arg 
180 185 190 

Asp Arg Glu His Trp Arg Ser Ala Leu Ala Gly Leu Pro Ala Glu Leu 
195 200 205 



68 



Leu Ala Gly Ala Gly Arg Pro Arg Pro Leu Pro Gly Pro Pro Val Arg 
210 215 220 

His Glu His Asp Leu Ser Ala Ala Glu Ala Gly Arg Leu Arg Ala Gly 
225 230 235 240 

Ala Arg Arg 'Leu Arg Thr Ser Val Ala Gin Ala Gly lie Ala Ala Ala 
245 250 255 

Ala Leu Tyr Gin His Arg Leu Thr Gly Ala Arg Asp Val Leu Val Ala 
260 265 ' 270 

Val Pro Val Ala Gly Arg Thr Thr Arg Pro Glu Phe Asp Val Pro Gly 
275 280 285 

Met Thr Ser Asn Val Val Pro Val Arg Leu Ala Val Thr Pro Ala Thr 
290 295 300 

Thr Val Gly Glu Leu Leu Arg Asp Val Ala Arg Gly Val Arg Asp Gly 
305 310 315 320 

Leu Arg His Gin Arg Tyr Pro Tyr Pro Asn lie Val Asp Asp Leu Gly 
325 330 335 

Leu Ala Asp Arg Ala Ala Leu Arg Pro Val Thr Val Asn Ala Leu Ala 
340 345 350 

Leu Gly Arg Pro Leu Arg Phe Gly Ser Ala Val Gly Val Arg Ser Gly 
355 360 365 

Leu Ser Ala Gly Pro Val Asp Asp Val Thr lie Gly Leu Tyr Glu Lys 
370 375 380 

Val Ser Gly Gly Gly Met Gin Thr lie Ala Glu Leu Asn Pro Gly Arg 
385 390 395 400 

Thr Asp Arg Pro Asp Ala Ala Glu Val Ser Arg Trp Phe Arg Thr Leu 
405 410 415 

Leu Arg Gly Leu Ala Glu Ser Asp Ala Gly Asp Pro Val Ala Arg lie 
420 425 430 

Asp lie Val Asp Glu Pro Glu Arg Arg Arg Leu Leu Asp Glu Trp Asn 
435 440 445 

Ala Thr Ala Ala Pro Ser Ser Asp Thr Val Leu Ala Arg Phe Glu Glu 
450 455 460 

Gin Ala Ala Arg Thr Pro Glu Ala Pro Ala Val Val Cys Gly Asp Val 
465 470 475 480 

Thr Val Thr Tyr Ala Glu Leu Glu Ala Gly Ala Asn Arg Leu Ala Arg 
485 490 495 

Val Leu Arg Ala Arg Gly Ala Gly Pro Glu Ser Val Val Ala Leu Cys 
500 505 510 

Leu Pro Arg Gly Pro Glu Val Val Thr Gly lie Leu Ala Ala Trp Lys 
515 520 525 



69 



Ala Gly Ala Ala Tyr Leu Pro Val Asp Thr Glu Leu Pro Ala Glu Arg 
530 535 540 

Val Ala Tyr Leu Leu Gly Asp Ser Ala Ala Ala Val Arg Leu Gly Thr 
545 550 555 560 

Ala Glu Thr Leu Ala Ala Leu Pro Asp Gly Pro Ala Ala Asp Val Asp 
565 570 575 

Val His Ala Pro Glu lie Ala Arg Glu Ser Pro Ser Pro Leu Arg Leu 
580 585 590 

Glu Pro Leu Pro Asp Gin Leu Ala Tyr Val lie Tyr Thr Ser Gly Ser 
595 600 605 

Thr Gly Leu Ser Lys Gly Val Gly Val Ser His Gly Gly Leu Ala Asn 
610 615 620 

Tyr Val Gly Trp Ala Ser Val Leu Tyr Gly Gly Leu Ser Ala Pro Leu 
625 630 635 640 

His Ser Ser Leu Ala Phe Asp Leu Thr Val Thr Ser Val Phe Val Pro 
645 650 655 

Leu Val Cys Gly Gly Ser Val Val Val Ser Ala Ala Gly Gly Gly Arg 
660 665 670 

Gly Leu Ala Ser Leu Leu Ala Ala Gly Asp Gly Phe Ser Leu Val Lys 
675 680 685 

Val Val Pro Gly His Leu Arg Leu Leu Ala Glu Leu Val Pro Ala Gly 
690 695 700 

Glu Met Ala Ala Val Gly Ser Leu Val Ala Gly Gly Glu Val Leu Ala 
705 710 715 720 

Gly Gly Asp Val Arg Glu Trp Leu Ser Arg Val Pro Gly Ser Val Val 
725 730 735 

Val Asn Glu Tyr Gly Pro Thr Glu Thr Val Val Gly Cys Ser Val Phe 
740 745 750 

Ser Val Ala Ala Gly Asp Val Val Gly Asp Val Val Pro Val Gly Arg 
755 760 765 

Pro Val Ala Asn Thr Arg Leu Phe Val Leu Asp Glu Gly Leu Arg Pro 
770 775 780 

Val Pro Ala Gly Val Ala Gly Glu Leu Tyr Val Ala Gly Ser Gin Val 
785 790 795 800 

Ala Arg Gly Tyr Val Gly Arg Ser Gly Leu Thr Ala Ser Arg Phe Val 
805 810 815 

Ala Cys Pro Phe Gly Val Gly Glu Arg Met Tyr Arg Thr Gly Asp Val 
820 825 830 

Val Arg Leu Ala Gly Gly Asp Leu Val Phe Val Gly Arg Val Asp Glu 
835 840 845 



70 



Gin Val Lys lie Arg Gly Tyr Arg Val Glu Pro Asp Glu Val Arg Leu 
850 855 860 

Val Val Ala Gly His Pro Arg Val Ala Gly Ala Ala Val Val Ala Arg 
865 870 875 880 

Pro Asp Ala Val Gly Glu Arg Gin Leu Val Ala Tyr Val Val Ala Ala 
885 890 895 

Gly Glu Pro Ala Gly Leu Ala Glu Ser Val Arg Ala His Val Ala Glu 
900 905 910 

Arg Leu Pro Glu Tyr Met Val Pro Ala Ala Val Val Thr Leu Asp Glu 
915 920 925 

lie Pro Leu Thr Val Asn Gly Lys Val Asp Arg Ala Ala Leu Pro Glu 
930 935 940 

Pro Gly Pro Val Ala Thr Gly Asn Ala Asp Arg Glu Pro Thr Thr Glu 
945 950 955 960 

Arg Glu Ser Leu Leu Cys Gly Ala Phe Ala Asp Val Leu Gly lie Glu 
965 970 975 

Arg Val Gly Val Asp Asp Asp Phe Phe Ser Leu Gly Gly His Ser Leu 
980 985 990 

Leu Ala Thr Ser Leu Val Ser Arg Val Arg Leu Val Leu Gly Glu Glu 
995 1000 1005 

Leu Pro lie Glu Glu Leu Phe Ala Thr Pro Thr Pro Ala Glu Leu 
1010 1015 1020 

Ala Ala Trp Leu Gin Arg Asn Ala Asp Arg Pro Gin Pro Ala Arg 
1025 1030 1035 

Pro Ala Leu Arg Pro Met His Glu Arg Glu Thr Thr Ala 
1040 1045 1050 

<210> 14 
<211> 6893 
<212> PRT 

<213>- Actinoplanes sp. 
<400> 14 

Met Thr Pro Met Ser Tyr Ala Gin Arg Arg Leu Trp Phe Gin Leu Arg 
15 10 15 

Val Glu Gly Pro Asp Ala Thr Tyr Asn Ser Pro Ala Val Leu Arg Leu 
20 25 30 

Thr Gly Glu Leu Asp Thr Ala Ala Leu Glu His Ala Leu Arg Asp Val 
35 40 45 

Leu Glu Arg His Glu Val Leu Arg Thr Val Tyr Pro Asp Val Gly Gly 
50 55 60 

Glu Pro Arg Gin Arg Val Val Arg Pro Asp Asp Met Val Trp Glu Leu 
65 70 75 80 
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Pro Thr Thr Arg Val Ser Gly Ala Gly Ala Gly Asp Asp Arg Leu Val 
85 90 95 

Thr Leu Asp Glu Leu Pro Trp Asp Arg Pro Val Leu Asp Leu Pro Ser 
100 105 110 

Pro Ala Pro Ala Gly Arg Glu Pro Asp Gly Glu lie Thr Val Asp Glu 
115 120 125 

Leu Pro Gly Ala lie Ala Arg Val Ala Ala His Pro Phe Asp Leu Ser 
130 135 140 

lie Glu lie Pro Val Arg Ala Arg Leu Phe Ala Leu Gly Pro Arg His 
145 150 155 160 

His Val Leu Val Val Val Leu His His lie Ala Thr Asp Gly Ser Ser 
165 170 175 

Gly Gly Pro Phe Ala Arg Asp Leu Ala Ala Ala Tyr Arg Ala Arg Arg 
180 185 190 

Thr Gly Thr Ala Pro Gin Trp Ala Pro Leu Pro Val Gin Tyr Ala Asp 
195 200 205 

Tyr Ala Ala Trp Gin Gin Glu Leu Leu Gly Ala Glu Asp Asp Pro Asp 
210 215 220 

Ser Val lie Ser Arg Gin Leu Ala His Trp Gin Glu Arg Leu Ala Gly 
225 230 235 240 

Met Pro Val Glu Leu Asp Leu Pro Ala Asp Arg Pro Arg Pro Ala Glu 
245 250 255 

Pro Gly His Gly Gly His Thr Lys Ala Leu Ser Leu Pro Pro Ala Val 
260 265 270 

His Arg Gly Leu Ala Thr Leu Ala Arg Arg Arg Arg Ala Thr Leu Gin 
275 280 285 

Met Val Val Gin Thr Gly Val Ala He Leu Leu Ser Lys Leu Gly Ala 
290 295 300 

Gly Arg Asp Val Pro Leu Gly He Pro Val Ala Gly Arg Thr Asp Ala 
305 310 315 320 

Ala Leu Asp Asp Leu He Gly Phe Phe Val Asn Thr Leu Val Val Arg 
325 330 335 

Ala Asp Leu Ser Gly Asp Pro Thr Val Ala Asp Ala Leu Gly Arg Val 
340 345 350 

Arg Gly Gly Ala Val Ala Ala Leu Ala Asp Gin Asp Val Pro Phe Asp 
355 360 365 

Lys Leu Val Glu Arg Leu Ala Pro Ala Arg Val Leu Gly Arg His Pro 
370 375 380 

Leu Phe Gin Val Met Val Ala Pro Leu Asp Asp Gly Thr Pro He Asp 
385 390 395 400 
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Leu Asp Gly Val Arg Gly Glu Pro Leu Thr lie Gly Arg Ser Gly Ala 
405 410 415 



Lys Phe Asp Val Glu Val Met Thr Gly Glu Val Arg Ala Ala Asp Gly 
420 425 430 

Ala Pro Ala Gly lie Arg Gly lie Leu Thr Leu Ser Ala Asp Leu Phe 
435 440 445 

Asp Glu Ala Thr Ala Gly Arg Met Ala Ala Gly Leu Val Arg Val Leu 
450 455 460 

Thr Ala Met Ala Glu Ala Pro Glu Arg Arg Leu Ser Gly lie Glu Val 
465 470 475 480 

Leu Ser Pro Gly Glu Arg Ser Arg Leu Leu Val Glu Trp Asn Asp Thr 
485 490 495 

Ala Arg Pro Val Val Glu Ser Ser Val Pro Ala Leu Phe Ala Lys Arg 
500 505 510 

Val Ala Ala Thr Pro Asp Ala Thr Ala Val Val Gly Glu Gly Val Ser 
515 520 525 

Trp Ser Tyr Arg Glu Leu Asp Arg Arg Ser Asp Val Leu Ala Arg Arg 
530 535 540 

Leu Val Ala Ala Gly Val Gly Val Glu Ser Pro Val Val Val Ala Leu 
545 550 555 560 

Glu Arg Ser Pro Glu Val Leu Ser Ala Phe Leu Ala Val Ala Lys Ala 
565 570 575 

Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp Pro Gin Ala Arg Val 
580 585 590 

Asp Ala Val Val Ala Asp Cys Ala Ala Arg Val Ala Val Ala Asp Arg 
595 600 605 

Pro Met Ser Gly Leu Thr Val Val Ser Ala Gly Leu Gly Gly Asp Ser 
610 615 620 

Ala Val Val Ser Ala Asp Leu Thr Ala Asp Arg Ala Val Val Leu Pro 
625 630 635 640 

Ser Arg Pro Val Pro Gly Ala Ala Val Tyr Arg Met Tyr Thr Ser Gly 
645 650 655 

Ser Thr Gly Arg Pro Lys Gly Val Val Thr Thr His Gin Asn Leu Val 
660 665 670 

Asp Leu Ala Thr Asp Thr Cys Trp Gly Pro Thr Pro Arg Val Leu Phe 
675 680 685 

His Ala Pro His Ala Phe Asp Ala Ser Ser Tyr Glu lie Trp Val Pro 
690 695 700 

Leu Leu Asn Gly Gly Thr Val Val Val Ala Pro Gin Arg Ser lie Asp 
705 710 715 720 
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Ala Thr Val Leu Lys Asp Leu lie Arg Ala His Asp Leu Thr His Val 
725 730 735 



His Val Thr Ala Gly Leu Leu Arg Val Leu Asp Pro Ser Cys Phe Ala 
740 745 750 

Gly Leu Thr Glu Val Leu Thr Gly Gly Asp Ala Val Ser Ala Glu Ala 
755 760 765 

Val Arg Arg Val Lys Asp Ala Asn Pro Gly Leu Arg Val Arg Gin Leu 
770 775 780 

Tyr Gly Pro Thr Glu Val Thr Leu Cys Ala Thr Gin His Leu Leu Asp 
785 790 795 800 

Asp Gly Val Pro lie Gly Arg Pro Leu Asp Asn Thr Arg Val Tyr Val 
805 810 815 

Leu Asp Asp Leu Leu Gin Pro Val Pro Val Gly Val Thr Gly Glu Leu 
820 825 830 

Tyr Val Ala Gly Ala Gly Val Ala Arg Gly Tyr Ala Gly Met Pro Gly 
835 840 845 

Leu Thr Ala Glu Arg Phe Val Ala Asp Pro Phe Asn Thr Gly Gly Arg 
850 855 860 

Leu Tyr Arg Thr Gly Asp Leu Val Arg Trp Thr Asp Asp Gly Val Leu 
865 870 875 880 

His Phe Ala Gly Arg Ala Asp Asp Gin Val Lys lie Arg Gly Tyr Arg 
885 890 895 

Val Glu Pro Gly Glu Val Glu Ala Val Leu Ala Gin His Pro Asp Val 
900 905 910 

Ser Gin Val Ala Val Val Val Arg Glu Asp Thr Pro Gly Asp Lys Arg 
915 920 925 

Leu Val Ala Tyr Val Val Gly Gly Asp He Glu Ala Tyr Gly Gin Glu 
930 935 940 

Arg Leu Pro Gly Tyr Met Val Pro Ser Ala Phe Val His Leu Asp Ala 
945 950 955 960 

Leu Pro Leu Thr Ser Asn Gin Lys Val Asp Arg Ala Ala Leu Pro Ala 
965 970 975 

Pro Ser Met Glu Ser Gly Ala Gly Arg Ala Pro Ala Asp Ala Arg Glu 
980 985 990 

Glu Leu Val Cys Ala Ala Phe Ala Glu Val Leu Gly Leu Asp Arg Val 
995 1000 1005 

Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu Leu 
1010 1015 1020 

Ala Val Ser Leu Val Glu Asp Leu Arg Gin Arg Gly Leu His Val 
1025 1030 1035 
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Ser Val Arg Ala Leu Phe Ala Thr Pro Thr Pro Ala Ala Leu Ala 
1040 1045 1050 

Val Ser Thr Val Ala Ala Pro lie Glu Val Pro Pro Asn Leu lie 
1055 1060 1065 

Pro Gin Gly Gly Ala Arg Glu Leu Thr Pro Asp Met Leu Pro Leu 
1070 1075 1080 

Val Asp Leu Thr Gly Glu Glu Leu Ala Thr lie Val Ala Ala Val 
1085 1090 1095 

Pro Gly Gly Ala Ala Asn lie Ala Asp lie Tyr Pro Leu Ala Pro 
1100 1105 1110 

Leu Gin Glu Gly lie Phe Phe His His Leu Met Thr Glu Gly Asp 
1115 1120 1125 

Thr Ala Asp Val Tyr Ala Leu Pro Tyr Leu Leu Arg Val Gly Thr 
1130 1135 1140 

Arg Glu Gin Leu Asp Ala Phe Leu Gly Ala Leu Gin Gin Val Val 
1145 1150 1155 

Asp Arg His Asp Val Tyr Arg Thr Ala lie Ala Trp Gin Asn Leu 
1160 1165 1170 

Arg Glu Pro Val Gin Val Val His Arg His Ala Thr Leu Pro Val 
1175 1180 1185 

Thr Glu Val Thr Pro Asp Gin Leu His Ala Ala Ala Thr Gly Gly 
1190 1195 1200 

Arg Leu Pro Leu Asp His Ala Pro Leu Leu Ser Val His lie Ala 
1205 1210 1215 

Pro Glu Pro Asp Gly Gly Trp Leu Ala Leu Leu Arg Met His His 
1220 1225 1230 

Leu Val Gin Asp His Thr Ala Leu Asp lie Val Leu Asp Glu lie 
1235 1240 1245 

Arg Thr lie Leu Ala Gly Ala Thr Asp His Leu Pro Pro Pro Val 
1250 1255 1260 

Pro Phe Arg Asn Phe Val Ala Arg Ser Arg Arg Gly Ala Ala Glu 
1265 1270 1275 

Ala Ala His Arg Asp Tyr Phe Thr Gly Leu Leu Gly Asp Val Thr 
1280 1285 1290 

Glu Thr Thr Ala Pro Tyr Gly Leu Thr Asp Val His Gly Glu His 
1295 1300 1305 

Ser Gly Val Arg Arg Gly Arg Leu Ala Val Ser Ala Gly Leu Ala 
1310 1315 1320 

Gly Arg Val Arg Glu Thr Ala Arg Asp Arg Gly Val Ser Pro Ala 
1325 1330 1335 
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Thr Leu Phe His Leu Ala Trp Ala Arg Val Leu Ala Ala Val Ser 
1340 1345 1350 



Gly Arg Asp Asp Val Val Phe Gly Thr Val Leu Leu Gly Arg Met 
1355 1360 1365 

Asp Ala Gly Pro Gly Ala Asp Arg Val Pro Gly Leu Phe Met Asn 
1370 1375 1380 

Thr Leu Pro Val Arg Val Arg Leu Gly Gly Arg Thr Val Asp Glu 
1385 1390 1395 

Ala Leu His Gly Met Arg Ala Gin Leu Ala Asp Leu Leu Thr His 
1400 1405 1410 

Glu His Ala Pro Leu Val Leu Ala Gin Gin Ser Ala Gly Leu Pro 
1415 1420 1425 

Gly Gly Ser Pro Leu Phe Thr Ser Leu Phe Asn Tyr Arg His Asn 
1430 1435 1440 

Ala Thr Asp He Glu Arg Ser Gly Thr Gly He Asp Gly Val Glu 
1445 1450 1455 

Ala Leu Pro Thr Gly Asp Pro Ser Asn Tyr Pro Leu Asp Val Ser 
1460 1465 1470 

Val Asn Gin Ser Pro Leu Gly Phe Glu Leu Val Val Glu Ala Thr 
1475 1480 1485 

Glu Pro Ala Asp Pro Asp Gin Leu Cys Arg Leu Leu His Ala Cys 
1490 1495 1500 

Leu Asp Asp Leu He Ala Ala Leu Asp Glu Gin Pro Gly Arg Ala 
1505 1510 1515 

Leu Gly Thr Leu Asp Val Val Ala Gly Arg Glu Arg Asp Leu Leu 
1520 1525 1530 

Leu Asp Gly Trp Asn Ala Thr Ala Val Pro Ala Gin Pro Ala Leu 
1535 1540 1545 

Val Pro Glu Leu Phe Thr Ala Gin Ala Ala Arg Thr Pro Thr Trp 
1550 1555 1560 

Pro Ala Leu Val Thr Ala Gly Ala Glu Met Ser Tyr Ala Glu Leu 
1565 1570 1575 

Glu Glu Arg Ser Asn Arg Leu Ala Arg Trp Leu Ala Gly Arg Gly 
1580 1585 1590 

Val Gly Ala Asp Asp Arg Val Ala Leu Met Met Arg Arg Gly Pro 
1595 1600 1605 

Glu Leu Met Val Ala He Leu Ala Val Leu Lys Ala Gly Ala Ala 
1610 1615 1620 

Tyr Leu Pro Val Asp Pro Asp Leu Pro Arg Asp Arg Val Asp Tyr 
1625 1630 1635 
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Leu Leu Ala Asp Ala Ala Pro Ala Phe Val Leu Ala Glu Arg Ala 
1640 1645 1650 



Thr Ala Pro Trp Val Pro Val Ala Gly Gly lie Pro Val Leu Val 
1655 1660 1665 

Val Asp Ala Pro Ala Val Ala Ala Glu Val Ala Ala His Ser Gly 
1670 1675 1680 

Glu Ala Val Thr Asp Arg Asp Arg Arg Ala Ala Leu Arg Gly Gly 
1685 1690 1695 

His Leu Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Arg Pro 
1700 1705 1710 

Lys Gly Val Leu lie Thr His Asp Gly Leu Ala Asn Leu Thr Leu 
1715 1720 1725 

Asp His Gly Arg Phe Gly Leu Gly Pro Gly Ala Arg Val Ala Gin 
1730 1735 1740 

Phe Ala Ser Pro Gly Phe Asp Met Phe Val Asp Glu Trp Ser Met 
1745 1750 1755 

Ala Leu Leu Ala Gly Ala Ala Leu Thr Phe Val Pro Pro Glu Arg 
1760 1765 1770 

Arg Leu Gly Ala Asp Leu Ala Ala Phe Leu Ala Glu Tyr Gly Val 
1775 1780 1785 

Thr His Ala Thr Leu Pro Pro Ala Val Val Gly Thr lie Pro Asp 
1790 1795 1800 

Gly Val Leu Pro Pro Ser Phe Val Leu Asp Val Gly Gly Asp Val 
1805 1810 1815 

Leu Pro Gly Asp Leu Ala Arg Arg Trp Leu Arg Asp Gly Arg Val 
1820 1825 1830 

Leu Phe Asn Ser Tyr Gly Pro Thr Glu Thr Thr Val Asn Ala Ala 
1835 1840 1845 

Thr Trp Arg Ala Glu Ala Gly Asp Trp Gly Ser Val Ala Pro lie 
1850 1855 1860 

Gly Thr Pro Val Pro Asn Leu Arg Ala Tyr Val Leu Asp Gly Trp 
1865 1870 1875 

Leu Arg Pro Val Pro Val Gly Ala Asp Gly Glu Leu Tyr Val Ser 
1880 1885 1890 

Gly Ala Gly Leu Ala Arg Gly Tyr Leu Asn Arg Ala Gly Leu Thr 
1895 1900 1905 

Ala Glu Arg Phe Val Ala Cys Pro Phe Glu Pro Gly Glu Arg Met 
1910 1915 1920 

Tyr Arg Thr Gly Asp Val Val Arg Trp Thr Ala Glu Gly Arg Leu 
1925 1930 1935 
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Val Phe Ala Gly Arg Ser Asp Asp Gin Val Lys He Arg Gly Phe 
1940 1945 1950 



Arg He Glu Pro Gly Glu Val Glu Ala Val Leu Ala Ala Gly Pro 
1955 1960 1965 

Gly Val Ser Gin Ala Ala Val He Val Arg Glu Asp Val Pro Gly 
1970 1975 1980 

Asp Lys Arg Leu Val Ala Tyr Val Val Gly Gly Asp Val Glu Ala 
1985 1990 1995 

Leu Arg Ser Tyr Ala Gin Gin Arg Leu Pro Gly Tyr Met Val Pro 
2000 2005 2010 

Ser Ala Phe Val Glu Leu Asp Arg Leu Pro Leu Thr Val Asn Gly 
2015 2020 2025 

Lys Leu Asp Arg Arg Ala Leu Pro Val Pro Asp Leu Ala Arg Gly 
2030 2035 2040 

Thr Gly Ser Gly Arg Pro Ala Gly Thr Pro Arg Glu Gin Leu Leu 
2045 2050 2055 

Cys Ala Gly Phe Ala Ala Val Leu Gly Val Asp Asp Val Gly Ala 
2060 2065 2070 

Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu Leu Val Val 
2075 2080 2085 

Ser Leu Val Glu Trp Leu Arg Arg Arg Gly Val Ser Val Pro Val 
2090 2095 2100 

Arg Ala Leu Phe Thr Thr Pro Thr Pro Ala Gly Leu Ala Glu Ala 
2105 2110 2115 

Val Gly Asp Gly Ala Val Val Val Pro Pro Asn Leu He Pro Glu 
2120 2125 2130 

Gly Ala Ala Glu Leu Thr Pro Glu Met Val Pro Leu Ala Asp Leu 
2135 2140 2145 

Thr Ser Glu Glu Leu Ala He Val Val Ala Ser Val Pro Gly Gly 
2150 2155 2160 

Ala Ala Asn Val Ala Asp Val Tyr Pro Leu Ala Pro Leu Gin Glu 
2165 2170 2175 

Gly He Phe Phe Pro Val Ala Thr Gly Pro Gin Cys Tyr Ala Thr 
2180 2185 2190 

Val Gly Ser Ser Leu Pro Asp Asp Gly Gly Ser Ala Pro Cys Ser 
2195 2200 2205 

Arg Phe Arg Arg Arg Cys Val . Ser Thr Ser Val Val Trp Gin Gly 
2210 2215 2220 

Leu Arg Glu Pro Val Gin Val Val Trp Arg His Ala Arg Leu Pro 
2225 2230 2235 
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Val Glu Glu Val Val Leu His Glu Gly Ala Asp Pro Val Glu Gin 
2240 2245 2250 

Met Met Ala Leu Ala Gly Gly Trp Met Asp Leu Thr Arg Ala Pro 
2255 2260 2265 

Leu lie Asp Val His lie Ala Ala Gly Pro Gly Gly Asp Arg Trp 
2270 2275 2280 

Leu Ala Val Leu Arg lie His His Leu Val Gin Asp His Thr Ala 
2285 2290 2295 

Leu Glu Thr Leu Leu Asp Glu Leu Gin Ser Phe Leu Glu Gly Arg 
2300 2305 2310 

Gly Gly Glu Leu Ala Glu Pro Val Pro Phe Arg Glu Phe Val Ala 
2315 2320 2325 

Gin Ala Arg Leu Gly Val Pro Arg Glu Glu His Glu Arg Tyr Phe 
2330 . 2335 2340 

Ala Glu Leu Leu Gly Asp lie Thr Glu Thr Thr Ala Pro Tyr Asp 
2345 2350 2355 

Leu Thr Asp Val His Gly Asp Gly Thr Gly Tyr Asp His Gly Ala 
2360 2365 2370 

Leu Pro Leu Asp Ala Thr Val Ala Ala Arg Val Arg Glu Ala Ala 
2375 2380 2385 

Arg Thr Leu Gly Val Ser Pro Ala Thr Leu Phe His Leu Ala Trp 
2390 2395 2400 

Ala Arg Val Leu Gly Thr Leu Ala Gly Arg Asp Asp Val Val Phe 
2405 2410 2415 

Gly Thr Val Leu Phe Gly Arg Met Asn Ser Gly Ala Gly Ala Asp 
2420 2425 2430 

Arg Val Ser Gly Leu Phe lie Asn Thr Leu Pro Val Arg Val Arg 
2435 2440 2445 

Leu Gly Ala Pro Thr Gly Asp Ala Leu Gly Asp Leu Arg Asp Gin 
2450 2455 2460 

Leu Ala Glu Leu Leu Val His Glu His Ala Ser Leu Ala Ser Ala 
2465 2470 2475 

Gin Lys Ala Ser Gly Leu Pro Gly Gly Ser Pro Leu Phe Thr Ser 
2480 2485 2490 

lie Phe Asn Tyr Arg His Asn Gin Val Ser Ala Glu Arg Glu Thr 
2495 2500 2505 

Ala Ala Leu Pro Gly lie Arg Val Leu Ala Ala Arg Asp Ser Thr 
2510 2515 2520 

Asn Tyr Pro Leu Thr Val Ala Val Asp Asp Asp Gly His Gly Phe 
2525 2530 2535 
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Thr Leu Val Val Glu Val Ala Ser Thr Val Asp Ala Ala Gly Val 
2540 2545 2550 



Cys Glu Leu Leu His Thr Ala Val Asp Asn Leu lie Ala Ala Leu 
2555 2560 2565 

Thr Asp Arg Pro Gly Gly Pro Leu Ala Glu Val Asp lie Leu Glu 
2570 2575 2580 

Arg Gly Leu Arg Asp Arg Leu Leu Thr Ala Trp Asn Glu Ala Arg 
2585 2590 2595 

Glu Pro Ala Pro Pro Val Thr Leu Pro Asp Leu Phe Asp Arg Gin 
2600 2605 2610 

Ala Arg Arg Thr Pro Glu Ala Val Ala Leu Thr Ala Asp Gly Val 
2615 2620 2625 

Ser Leu Thr Tyr Arg Glu Leu Ser Glu Arg Ala Asn Arg lie Ala 
2630 2635 2640 

Arg Leu Leu Thr Ser Arg Gly lie Gly Pro Glu Ser Leu Val Gly 
2645 2650 2655 

Val Val Leu Pro Arg Ser Ala Asp Leu Val Val Ala Leu Leu Gly 
2660 2665 2670 

Val Leu Gin Ala Gly Ala Ala Tyr Val Pro Val Asp Ala Asp Tyr 
2675 2680 2685 

Pro Ala Glu Arg lie Gly Tyr lie Leu Gly Asp Ala Gly Ala Val 
2690 2695 2700 

Cys Val Leu Thr Val Asp Ala Thr Ala Gly Ala Val Pro Pro Gly 
2705 2710 2715 

Val Pro Lys Leu Val Leu Asp His Pro Glu Thr Val Thr Ala Leu 
2720 2725 2730 

Ala Ala Cys Asp Thr Ala Pro Leu Gly Glu Ala Glu Arg Ala Gly 
2735 2740 2745 

Glu Leu Leu Pro Glu His Pro Ala Tyr Val lie Tyr Thr Ser Gly 
2750 2755 2760 

Ser Thr Gly Thr Pro Lys Gly Val Leu lie Pro His Arg Asn Val 
2765 2770 2775 

Val Glu Leu Phe Ala Ala Thr Arg Gly Ser Phe His Phe Gly Glu 
2780 2785 2790 

Gly Asp Val Trp Ser Trp Phe His Ser Val- Ala Phe Asp Phe Ser 
2795 2800 2805 

Val Trp Glu Leu Trp Gly Ala Leu Leu His Gly Gly Arg Val Val 
2810 2815 2820 

Met Val Pro Phe Ala Val Ser Arg Ser Pro Arg Asp Phe Trp Glu 
2825 2830 2835 
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Leu Leu Val Arg Glu Arg Val Thr Val Leu Ser Gin Thr Pro Ser 
2840 2845 2850 



Ala Phe Tyr Gin Leu Ala Ala Ala Ala Asp Asp Thr Pro Asp Ala 
2855 2860 2865 

Leu Arg Val Val Val Phe Gly Gly Glu Ala Leu Asp Pro Gly Arg 
2870 2875 2880 

Leu Ala Gly Trp Arg Glu Arg Arg Pro Asp Gly Pro Arg Leu Val 
2885 2890 2895 

Asn Met Tyr Gly lie Thr Glu Thr Thr Val His Val Thr His Gin 
2900 2905 2910 

Asp Leu Ala Pro Ala Asp Thr Thr Gly Ser Pro He Gly Arg Gly 
2915 2920 2925 

He Pro Gly Leu Ser Val Tyr Val Leu Asp Glu Ala Leu Arg Pro 
2930 2935 2940 

Val Pro Pro Gly Val Ala Gly Glu Val Tyr Val Ala Gly Arg Gin 
2945 2950 2955 

Leu Ala Arg Ala Tyr Leu Gly Arg Ala Ala Leu Thr Gly Thr Arg 
2960 2965 2970 

Phe Val Ala Cys Pro Phe Leu Pro Ala Gly Glu Arg Met Tyr Arg 
2975 2980 2985 

Thr Gly Asp Arg Ala Arg Trp Ser Arg Gly Arg Leu Gin Phe Ala 
2990 2995 3000 

Gly Arg Thr Asp Asp Gin Val Gin He Arg Gly Phe Arg He Glu 
3005 * 3010 3015 

Pro Gly Glu Val Gin Ala Val Val Ala Ala His Pro Glu He Ala 
3020 3025 3030 

Ala Ala Ala Val Val Val Arg Glu Asp Val Pro Gly Asp Pro Arg 
3035 3040 3045 

Leu Thr Ala Tyr Val Val Pro Ala Gly Pro Arg Thr Ala Pro Ala 
3050 3055 3060 

Ala Val Ala Glu Thr Val Arg Arg Phe Ala. Ala Asp Arg Leu Pro 
3065 3070 3075 

Ala Tyr Met Leu Pro Ser Ala Val Val Val Leu Asp Ala Leu Pro 
3080 3085 3090 

Leu Thr Asp His Gly Lys Leu Asp Arg Arg Ala Leu Pro Ala Pro 
3095 3100 3105 

Gin His Thr Gly Ala Ala Ser Gly Arg Ala Pro Ala Thr Val Ala 
3110 3115 3120 

Glu Glu Val Leu Cys Ala Ala Phe Ala Glu Val Leu Gly Val Glu 
3125 3130 3135 
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Arg Val Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser 
3140 3145 3150 



Leu Leu lie Val Ser Leu Val Glu Arg Val Arg Arg Ala Gly Leu 
3155 3160 3165 

Ala lie Pro Val Arg Ala Leu Phe Arg Ser Ala Thr Pro Ala Gly 
3170 3175 3180 

Leu Ala Ala Leu Ala Arg Pro Tyr Arg Val Asp lie Pro Pro Asn 
3185 3190 3195 

Leu Val Pro Asp Gly Ala Arg Glu lie Thr Pro Asp Met Leu Thr 
3200 3205 3210 

Leu Ala Ala Leu Thr Glu Ala Glu lie Ala Thr Val Leu Ala Thr 
3215 3220 3225 

Val Pro Gly Gly Ala Val Asn Val Ala Asp lie Tyr Pro Leu Ala 
3230 3235 3240 

Pro Leu Gin Glu Gly lie Phe Phe His His Leu Met Ala Asp Ala 
3245 3250 3255 

Gly Arg Ala Asp Ala Tyr Ala Met Pro Tyr Val Leu His Leu Asp 
3260 3265 3270 

Thr Ala Glu Arg Leu Asp Val Leu Leu Gly Ala Leu Gin Arg Val 
3275 3280 3285 

lie Asp Arg Asn Asp lie Tyr Arg Thr Gly Val Val Ser Ala Gly 
3290 3295 3300 

Leu Arg Glu Pro Val Gin Val Val Trp Arg Ser Ala Val Leu Pro 
3305 3310 3315 

Val Glu Glu Val Ala Leu Asp Gly Gly His Asp Pro Val Glu Gin 
3320 3325 3330 

Leu Leu Ala Ala Ala Gly Glu Glu Phe Asp Leu Thr Arg Ala Pro 
3335 3340 3345 

Leu lie Arg Ala His Val Ala Ala His Pro Asp Gly Gly Arg Leu 
3350 3355 3360 

Leu Leu Leu Arg lie His His Leu Val Gin Asp His Thr Thr Phe 
3365 3370 3375 

Asp Val Val Leu Gly Glu Leu Arg Ala Phe Leu Glu Gly Arg Gly 
3380 3385 3390 

Gly Glu Leu Ala Glu Pro Val Pro Phe Arg Glu Phe Val Ala Gin 
3395 3400 3405 

Ala Arg Leu Gly Val Pro Arg Glu Glu His Glu Arg Tyr Phe Ala 
3410 3415 3420 

Glu Leu Leu Gly Asp Val Thr Glu Thr Thr Ala Pro Tyr Gly Leu 
3425 3430 3435 
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Thr Asp Val His Gly Asp Gly Ser Arg Ala Val Gin Val Ser Leu 
3440 3445 3450 



Pro Val Ala Glu Ala Leu Ala Val Arg Val Arg Glu Val Ala Arg 
3455 3460 3465 

Thr Leu Gly Val Ser Pro Ala Thr Val Phe His Leu Ala Trp Ala 
3470 3475 3480 

Arg Val Leu Ser Val lie Ala Gly Arg Asp Asp Val Val Phe Gly 
3485 3490 3495 

Thr He Leu Phe Gly Arg Met Asn Ser Gly Ala Ala Ala Glu Arg 
3500 3505 3510 

Val Pro Gly Leu Phe He Asn Thr Leu Pro Val Arg Val Arg Leu 
3515 3520 3525 

Asn Gly Thr Ser Val Gly Glu Ala Leu Thr Ala Leu Arg Asp Gin 
3530 3535 3540 

Met Ala Glu Leu Met Ala His Glu His Ala Pro Leu Ala Leu Ala 
3545 3550 3555 

Gin Arg Ala Gly Gly Val Pro Ala Gly Ser Pro Leu Phe Thr Ser 
3560 3565 3570 

Leu Phe Asn Tyr Arg His Asn Val Ala Gly Gly Gly Asp Gly Gly 
3575 3580 3585 

Ala Leu Glu Gly Val Thr Pro Val Leu His Arg Asp Thr Thr Asn 
3590 3595 3600 

"Tyr Pro Val Val Val Ser Val Asp Asp Asp Gly Thr Ser Phe Asp 
3605 3610 3615 

Leu Val Val Glu Ala Val Ala Pro Ala Glu Ala Gly Arg Val Gly 
3620 3625 3630 

Arg Leu Met His Glu Cys Leu Ala Glu Leu Val Gly Ala Leu Ala 
3635 3640 3645 

Gly Ala Pro Glu Thr Pro Leu Ser Arg Val Arg Val He Asp Glu 
3650 3655 3660 

Ala Glu He Glu Arg Val Val His Ser Trp Asn Asp Thr Ala Arg 
3665 3670 3675 

Pro Val Val Glu Ser Ser Val Pro Ala Leu Phe Ala- Glu Gin Val 
3680 3685 3690 

Ala Ala Ala Pro Asp Ala Thr Ala Val Val Gly Glu Gly Val Ser 
3695 3700 3705 

Trp Ser Tyr Arg Glu Leu Asp Ala Arg Ser Asp Ala Leu Ala Arg 
1710 3715 3720 

Ser Leu Val Ala Ala Gly Val Gly Val Glu Ser Pro Val Val Val 
3725 3730 3735 
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Ala Leu Glu Arg Ser Pro Glu Val Leu Ser Ala Phe Leu Ala Val 

3740 3745 3750 

Ala Lys Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp Pro 

3755 3760 3765 

Gin Ala Arg lie Asp Ala Val Val Ala Asp Cys Ala Ala Arg Val 

3770 3775 3780 

Ala Val Ala Asp Arg Pro Met Ser Gly Leu Thr Val Val Pro Ala 

3785 3790 . 3795 

Asp Gin Val Gly Asp Ser Ala Val Val Leu Pro Ala Gly Pro Val 

3800 3805 3810 

Pro Gly Ala Ala Val Tyr Arg Met Tyr Thr Ser Gly Ser Thr Gly 

3815 3820 3825 

Arg Pro Lys Gly Val Val Thr Thr His Gin Asn Leu Val Asp Leu 

3830 3835 3840 

Ala Thr Asp Thr Cys Trp Gly Pro Thr Pro Arg Val Leu Phe His 

3845 3850 3855 

Ala Pro His Ala Phe Asp Ala Ser Ser Tyr Glu lie Trp Val Pro 

3860 3865 3870 

Leu Leu Asn Gly Gly Thr Val Val Val Ala Pro Gin Arg Ser lie 

3875 3880 3885 

Asp Ala Thr Val Leu Arg Asp Leu lie Arg Gly His Glu Leu Thr 

3890 3895 3900 

His Val His Val Thr Ala Gly Leu Leu Arg Val Leu Asp Pro Ser 

3905 3910 3915 

Cys Phe Ala Gly Leu Thr Glu Val Leu Thr Gly Gly Asp Ala Val 

3920 3925 3930 

Ser Ala Glu Ala Val Arg Arg Val Arg Glu Ala Asn Pro Gly Leu 

3935 3940 3945 

Arg Val Arg Gin Leu Tyr Gly Pro Thr Glu Val Thr Leu Cys Ala 

3950 3955 3960 

Thr Gin His Leu Leu Val Asp Gly Val Pro lie Gly Arg Pro Leu 

3965 3970 3975 

Asp Asn Thr Arg Val Tyr Val Leu Asp Asp Leu Leu Gin Pro Val 

3980 3985 3990 

Pro Val Gly Val Thr Gly Glu Leu Tyr Val Ala Gly Ala Gly Leu 

3995 4000 4005 

Ala Arg Gly Tyr Ala Gly Met Pro Gly Leu Thr Ala Glu Arg Phe 

4010 4015 4020 

Val Ala Asp Pro Phe Ser Val Gly Gly Arg Leu Tyr Arg Thr Gly 

4025 4030 4035 
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Asp Leu Val Arg Trp Thr Asp Asp Gly Val Leu His Phe Ala Gly 
4040 4045 4050 



Arg Ala Asp Asp Gin Val Lys lie Arg Gly Tyr Arg Val Glu Pro 
4055 4060 4065 

Gly Glu Val Glu Ala Val Leu Ala Gin His Pro Asp Val Ser Gin 
4070 4075 4080 

Val Ala Val Val Val Arg Glu Asp Thr Pro Gly Asp Lys Arg Leu 
4085 4090 4095 

Val Ala Tyr Val Val Gly Gly Asp Val Glu Ala Tyr Ala Gin Glu 
4100 4105 4110 

Arg Leu Pro Gly Tyr Leu Val Pro Ser Ala Phe Val His Leu Asp 
4115 4120 4125 

Ala Leu Pro Leu Thr Ser Asn Gin Lys Val Asp Arg Ala Ala Leu 
4130 4135 4140 

Pro Ala Pro Ser Val Glu Ser Gly Val Gly Arg Ala Pro Ala Asp 
4145 4150 4155 

Ala Arg Glu Glu Leu Met Cys Ala Ala Phe Ala Glu Val Leu Asp 
4160 4165 4170 

Leu Asp Arg Val Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly 
4175 4180 4185 

His Ser Leu Leu Val Val Arg Leu Val Gly Arg lie Arg Gin Val 
4190 4195 4200 

Phe Gly Val Glu Val Ser Ala Arg Leu Val Phe Asp Ala Arg Thr 
4205 4210 4215 

Pro Ala Gly Val Val Ala Arg Leu Ser Glu Gly Gly Thr Ala Arg 
4220 4225 4230 

Glu Ala Val Arg Ala Arg Val Arg Pro Ala Arg Val Pro Leu Ser 
4235 4240 4245 

Phe Ala Gin Arg Arg Leu Trp Phe Leu Ser Gin Leu Glu Gly Pro 
4250 4255 4260 

Ser Ala Thr Tyr Asn lie Pro Val Ala Leu Arg Leu Asp Gly Pro 
4265 4270 4275 

Leu Asp Arg Asp Ala Leu Thr Ala Ala Leu His Asp Val Val Ala 
4280 4285 4290 

Arg His Glu Val Leu Arg Thr Val Phe Thr Val Ala Asp Gly Glu 
4295 4300 4305 

Pro Trp Gin Gin lie Leu Asp Asp Pro Gin Val Ser Val Pro Val 
4310 4315 4320 

Val Glu Val Thr Pro Asp Arg Leu Pro Glu Ala Val Ala Val Ala 
4325 4330 4335 
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Ala Gly His Arg Phe Asp Leu Gly Arg Glu Leu Pro Leu Arg Ala 
4340 4345 4350 



Val Leu Leu Ala Thr Gly Asp Asp Val His Val Leu Val Leu Val 
4355 4360 4365 

Val His His lie Ala Ala Asp Gly Trp Ser Met Arg Pro Leu Ala 
4370 4375 4380 

Arg Asp Leu Ala Ala Ala Tyr Ala Ala Arg lie Asp Ala Thr Ala 
4385 4390 4395 

Pro Ala Leu Gly Ala Leu Pro Val Gin Tyr Ala Asp Tyr Ala Leu 
4400 4405 4410 

Trp Gin Arg Asp Val Leu Gly Ser Glu His Asp Pro Asp Ser Val 
4415 4420 4425 

lie Ser Gin Gin Val Ala Tyr Trp Arg Arg Gin Leu Ala Gly Val 
4430 4435 4440 

Pro Glu Glu Leu Asp Leu Pro Val Asp Arg Ala Arg Pro Ala Glu 
4445 4450 4455 

Ala Ser His Arg Gly His Thr Val Glu Phe Ala Val Pro Pro Ala 
4460 4465 4470 

Val His His Gin Leu Ala Glu Leu Ala Arg Arg Asn Gly Val Thr 
4475 4480 4485 

Val Phe Met Thr Val Gin Thr Ala Leu Ala Val Leu Leu Ser Lys 
4490 4495 4500 

Leu Gly Ala Gly Thr Asp He Pro He Gly Val Ala Val Ala Gly 
4505 4510 4515 

Arg Thr Asp Pro Thr Leu Asp Asn Leu He Gly Phe Phe Val Asn 
4520 4525 4530 

Thr Leu Val Leu Arg Thr Asp Leu Thr Gly Asn Pro Thr lie Thr 
4535 4540 4545 

Asp Leu Leu His Arg Thr Arg Asp Thr Thr Leu His Ala Phe Thr 
4550 4555 4560 

His Gin Asp Val Pro Phe Glu Lys Leu Val Glu Asp Leu Ala Pro 
4565 4570 4575 

Thr Arg Ser Leu Ala Arg His Pro Leu Phe Gin Val Met Met Thr 
4580 4585 4590 

Leu Gin Ser Ala Ser Ala Asp Glu Glu Pro Leu Ala Leu Ala Gly 
4595 4600 4605 

Leu Arg Val Thr Asp Leu Pro Ala Gly Glu Thr Pro Ala Lys Val 
4610 4615 4620 

Asp Leu Asp Leu Thr Leu His Glu Val Ala Gly Arg Asp Gly Met 
4625 4630 4635 
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His Ala Thr Leu Leu Gly Ala Ala Asp Leu Phe Glu Gin Glu Thr 
4640 4645 4650 



Val Arg Ala Leu Ala Asp Arg Leu Leu Arg Thr Leu Glu Ala Met 
4655 4660 4665 

Ala Ala Ala Pro Asp Asp Arg Leu Asp Arg lie Glu Val Leu Ser 
4670 4675 4680 

Pro Gly Glu Arg Ser Arg Leu Leu Val Glu Trp Asn Asp Thr Ala 
4685 4690 4695 

Arg Pro Val Val Glu Ser Ser Val Pro Ala Leu Phe Ala Glu Gin 
4700 4705 4710 

Val Ala Ala Ala Pro Asp Ala Val Ala Val Val Gly Glu Gly Val 
4715 4720 4725 

Ser Trp Thr Tyr Arg Glu Leu Asp Ala Arg Ser Asp Ala Leu Ala 
4730 4735 4740 

Arg Ser Leu Val Ala Ala Gly Val Gly Val Glu Ser Pro Val Val 
4745 4750 4755 

Val Ala Leu Glu Arg Ser Pro Glu Val Leu Ser Ala Phe Leu Ala 
4760 4765 4770 

Val Ala Lys Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp 
4775 4780 4785 

Pro Gin Ala Arg Val Asp Ala Val Val Ala Asp Cys Gly Ala Arg 
4790 4795 4800 

He Ala Val Ala Asp- Arg Pro Met Ser Gly Leu Thr Val Val Ser 
4805 4810 4815 

Ala Gly Leu Gly Gly Asp Ser Ala Val Val Ser Gly Asp Leu Thr 
4820 4825 4830 

Ala Asp Arg Ala Val Val Leu Pro Ala Gly Pro Val Pro Gly Ala 
4835 4840 4845 

Ala Val Tyr Arg Met Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys 
4850 4855 4860 

Gly Val Val Thr Thr His Gin Asn Leu Val Asp Leu Ala Thr Asp 
4865 4870 4875 

Thr Cys Trp Gly Pro Thr Pro Arg Val Leu Phe His Ala Pro His 
4880 4885 4890 

Ala Phe Asp Ala Ser Ser Tyr Glu He Trp Val Pro Leu Leu Asn 
4895 4900 4905 

Gly Gly Thr Val Val Val Ala Pro Arg Arg Ser He Asp Ala Thr 
4910 4915 4920 

Val Leu Arg Asp Leu He Gly Ala His Glu Leu Thr His Val His 
4925 4930 4935 
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Val Thr Ala Gly Leu Leu Arg Val Leu Asp Pro Ser Cys Phe Ala 
4940 4945 4950 



Gly Leu Thr Glu Val Leu Thr Gly Gly Asp Ala Val Ser Ala Glu 
4955 4960 4965 

Ala Val Arg Arg Val Lys Asp Ala Asn Pro Gly Leu Arg Val Arg 
4970 4975 4980 

Gin Leu Tyr Gly Pro Thr Glu Val Thr Leu Cys Ala Thr Gin His 
4985 4990 4995 

Leu Leu Asp Asp Gly Val Pro lie Gly Arg Pro Leu Asp Asn Thr 
5000 5005 5010 

Arg Val Tyr Val Leu Asp Asp Leu Leu Arg Pro Val Pro Thr Gly 
5015 5020 5025 

Val Val Gly Glu Leu Tyr Val Ala Gly Ser Gly Leu Ala Arg Gly 
5030 5035 5040 

Tyr Ala Gly Met Pro Gly Leu Thr Ala Glu Arg Phe Val Ala Asp 
5045 5050 5055 

Pro Phe Asn Thr Gly Gly Arg Leu Tyr Arg Thr Gly Asp Leu Val 
5060 5065 5070 

Arg Trp Ala Asp Asp Gly Val Leu His Phe Ala Gly Arg Ala Asp 
5075 5080 5085 

Asp Gin Val Lys lie Arg Gly Tyr Arg Val Glu Pro Gly Glu Val 
5090 5095 5100 

Glu Ala Val Leu Ala Gin His Pro Asp Val Ser Gin Val Ala Val 
5105 5110 5115 

Val Val Arg Glu Asp Thr Pro Gly Asp Lys Arg Leu Val Ala Tyr 
5120 5125 5130 

Val Val Gly Gly Asp Val Glu Ala Tyr Ala Gin Glu Arg Leu Pro 
5135 5140 5145 

Gly Tyr Met Val Pro Ser Ala Phe Val Gin Leu Asp Ala Leu Pro 
5150 5155 5160 

Leu Thr Ser Asn Gin Lys Val Asp Arg Ala Ala Leu Pro Ala Pro 
5165 5170 5175 

Ser Met Glu Ser Gly Ala Gly Arg Ala Pro Ala Asp Ala Arg Glu 
5180 5185 5190 

Glu Leu Met Cys Ala Ala Phe Ala Glu Val Leu Asp Leu Asp Arg 
5195 5200 5205 

Val Gly Val Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu 
5210 5215 5220 

Leu Ala Val Ser Leu Val Glu Asn Leu Arg Arg His Gly Val His 
5225 5230 5235 
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lie Ser Val Arg Ala Leu Phe Ala Thr Pro Thr Pro Ala Ala Leu 
5240 5245 5250 

Ala Ala Ser Ala Gly Thr Ala Val Pro Asp Val Pro Pro Asn Leu 
5255 5260 5265 

lie Pro Gin Gly Gly Ala Gin Glu Leu Thr Pro Asp Met Leu Pro 
5270 5275 5280 

Leu Val Asp Leu Thr Gly Glu Glu Leu Ala Thr lie Val Ala Ala 
5285 5290 5295 

Val Pro Gly Gly Ala Pro Asn lie Ala Asp lie Tyr Pro Leu Ala 
5300 5305 5310 

Pro Leu Gin Glu Gly lie Phe Phe His His Leu Met Thr Glu Gly 
5315 5320 5325 

Asp Ala Thr Asp Val Tyr Leu Leu Pro Arg lie Leu Gly Phe Gly 
5330 5335 5340 

Gly Arg Pro Glu Leu Asp Ala Phe Leu Gly Ala Leu Gin Gin Val 
5345 5350 5355 

Val Asp Arg His Asp Val Tyr Arg Thr Ala lie Ala Trp Gin Asn 
5360 5365 5370 

Leu Arg Glu Pro Val Gin Val Val His Arg His Ala Thr Leu Pro 
5375 5380 5385 

Val Thr Glu Val Thr Pro Asp Gin Leu His Ala Ala Ala Thr Gly 
5390 5395 5400 

Gly Arg Leu Pro Leu Asp His Ala Pro Leu Leu Ser Val His lie 
5405 5410 5415 

Ala Pro Glu Pro Asp Gly Gly Trp Leu Ala Leu Leu Arg Met His 
5420 5425 5430 

His Leu Val Gin Asp His Thr Ala Leu Asp lie Val Leu Asp Glu 
5435 5440 5445 

lie Arg Thr lie Leu Ala Gly Ala Thr Asp His Leu Pro Pro Pro 
5450 * 5455 5460 

Val Pro Phe Arg Asp Phe Val Ala Gin Ala Arg Leu Gly Val Ser 
5465 5470 5475 

Arg Ala Glu Gin Glu Arg Tyr Phe Ala Gly Leu Leu Gly Asp Val 
5480 5485 5490 

Thr Glu Thr Thr Ala Pro Tyr Gly Leu Ala Asp Val Thr Asn Asp 
5495 5500 5505 

Gly Thr Ala Ser Val Arg Ala Glu Val Glu Leu Asp Ala Ala Leu 
5510 5515 5520 

Ala Ala Arg Leu Arg Asp Leu Ala Arg Asp Arg Gly Val Ser Pro 
5525 5530 5535 
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Ala Thr Val Phe His Leu Ala Trp Ala Arg Val Leu Ala Ala Val 
5540 5545 5550 

Ala Asp Arg Glu Asp Val Val Phe Gly Thr Val Leu Phe Gly Arg 
5555 5560 5565 

Met Ala Ser Gly Ala Arg Arg Val Pro Gly Leu Phe Met Asn Thr 
5570 5575 5580 

Leu Pro Val Arg Val Arg Leu Ser Gly Thr Ala Ala Glu Ala Leu 
5585 5590 5595 

Gly Gin Val Arg Asp Arg Leu Ala Glu Leu Met Ala His Glu His 
5600 5605 5610 

Ala Pro Leu Ala Leu Ala Gin Gin Ala Ser Gly Leu Pro Ala Gly 
5615 5620 5625 

Ser Pro Leu Phe Thr Ser Leu Phe Asn Tyr Arg Tyr Ala Arg Pro 
5630 5635 5640 

Pro Ala Ala Thr Pro Asp Asp Pro Leu Ala Gly Val Arg Thr Leu 
5645 5650 5655 

Phe Ala Trp Glu Arg Asn Asn Tyr Pro Val Thr Val Ser lie Asp 
5660 5665 5670 

Asp Asp Gly Thr Gly Phe Ala Val Thr Val Asp Val Val Ala Pro 
5675 5680 5685 

Ala Asp Ala Asp Glu Val Val Arg Leu Leu Arg Thr Thr Leu Thr 
5690 5695 5700 

Arg Leu Ala Ala Ala Leu Glu Arg Thr Pro Glu Met Pro Val Ala 
5705 5710 5715 

Asp Val Arg Pro Gly Arg Val Ser Arg Pro Ala Ala Gly Arg Ala 
5720 5725 5730 

Val Leu Val Pro Val Pro Ala Gly Glu Arg Ala Thr Gly Ala Gly 
5735 5740 5745 

Arg Ala Pro Ala Thr Ala Tyr Glu Glu Leu lie Cys Gin Ala Tyr 
5750 5755 5760 

Ala Gin Val Leu Glu Val Asp Arg Val Ala Ala Asp Asp Asp Phe 
5765 5770 5775 

Phe Ala Leu Gly Gly Asn Ser Leu Leu Ala Thr Arg Leu Val Ser 
5780 5785 5790 

Arg lie Arg Ser Ala Leu Gly Val Glu Val Thr lie Arg Ala Leu 
5795 5800 5805 

Phe Glu Thr Leu Thr Pro Gin Arg Leu Ala Ala Arg Leu Thr Arg 
5810 5815 5820 

Ala Ser Ala Pro Gly Arg Val Ala Pro Ala Pro Arg Thr Arg Pro 
5825 5830 5835 
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Glu Arg lie Pro Leu Ser Phe Ala Gin Arg Arg Leu Trp Phe Leu 
5840 5845 5850 

Gly Glu Leu Glu Gly Ser Ser Ala Thr Tyr Ser Asn Thr Thr Ala 
5855 5860 5865 

Leu Arg Leu Ser Gly Glu Leu Asp Pro Ala Ala Leu Thr Ala Ala 
5870 5875 5880 

Leu His Asp Val lie Gly Arg His Glu Val Leu Arg Thr Val lie 
5885 5890 5895 

Pro Ala Glu Asp Gly Arg Pro Tyr Gin Leu Val Leu Pro Pro Glu 
5900 5905 5910 

Glu Ala Arg Pro Ala Val Glu lie Val Glu Val Ala Pro Gly Glu 
5915 5920 5925 

Leu Gly Ala Ala Val Asp Glu Val Ala Gly Tyr Ala Phe Asp Leu 
5930 5935 5940 

Ala Ala Glu lie Pro Val Arg Ala Arg Leu lie Arg Leu Gly Ala 
5945 5950 5955 

Thr Asp His Val Leu Val Leu Val lie His His lie Ala Thr Asp 
5960 5965 5970 

Gly Trp Ser Met Ala Pro Leu Ala Arg Asp Leu Ala Ala Ala Tyr 
5975 5980 5985 

Glu Ala Arg Leu Ala Gly Arg Ala Pro Arg Trp Glu Pro Leu Pro 
5990 5995 6000 

Leu Gin Tyr Ala Asp Tyr Ala Leu Trp Gin Glu Glu Leu Leu Gly 
6005 6010 6015 

Ala Ala Gly Asp Pro Glu Ser Leu Arg Glu Arg Gin Leu Ala Tyr 
6020 6025 6030 

Trp Arg Asp Thr Leu Ala Gly Met Pro Pro Glu lie Pro Leu Pro 
6035 6040 6045 

Ala Asp Arg Ser Arg Pro Pro Val Ala Ser His Arg Gly Gly Glu 
6050 6055 6060 

Val Pro lie Ala lie Pro Ala Asp Leu His Arg Arg Leu Ala Glu 
6065 6070 6075 

Leu Ala Val Ala Glu Arg Ala Thr Leu Phe Met Val Leu Gin Ala 
6080 6085 6090 

Gly Phe Ala Ala Leu Leu Ser Arg Leu Gly Ala Gly Thr Asp Val 
6095 6100 6105 

Pro lie Gly Thr Ala Leu Ala Gly Arg Thr Asp Asp Ala Leu Asp 
6110 6115 6120 

Glu Leu Val Gly Phe Phe Val Asn Met Leu Val Leu Arg Thr Asp 
6125 6130 6135 
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Val Ser Gly Asp Pro Gly Phe Gly Thr Leu Leu Arg Arg Val Arg 

6140 6145 6150 

Glu Thr Gly Leu Ala Ala Tyr Ala His Gin Asp Val Pro Phe Asp 

6155 6160 6165 

Gin Val Val Glu Glu Leu Val Thr Glu Arg Ser Leu Ala Arg His 

6170 6175 6180 

Pro Leu Phe Gin Val Ala Leu Thr Val Gin Asn Ala Pro Gly Ala 

6185 6190 6195 

Arg Pro Arg Leu Ala Gly Leu Glu Val Gly Thr Glu Pro lie Glu 

6200 6205 6210 

His Gly lie Ala Arg Tyr Asp Leu Thr Leu Thr Val Thr Glu Arg 

6215 6220 6225 

Arg Asp Glu. His Gly Ala Pro Asp Gly Leu Glu Gly His Leu Glu 

6230 6235 6240 

Phe Ser Arg Asp Leu Phe Asp Ala Pro Thr Val Ala Thr Leu Gly 

6245 6250 6255 

Asp Arg Leu lie Arg Leu Leu Thr Ala Ala Val Ala Asp Pro Glu 

6260 6265 6270 

Leu Pro Leu Ser Arg lie Asp Leu Met Ala Pro Ala Glu Arg Arg 

6275 6280 6285 

Asn Val Leu Glu Gly Trp Ser Thr Ala Arg Arg Asp Val Pro Ala 

6290 6295 6300 

Ala Thr Val Pro Glu Leu Val Ala Ala Gin Val Ala Arg Arg Pro 

6305 6310 6315 

Gly Ala Val Ala Leu Arg Ser Glu Asp Gly Glu lie Thr Tyr Ala 

6320 6325 6330 

Glu Leu Asp Ala Arg Ala Gly Arg Leu Ala Ala Val Leu Arg Arg 

6335 6340 6345 

Arg Gly lie Gly Pro Glu Ser Arg Val Ala Val Leu Leu Pro Arg 

6350 6355 6360 

Gly Val Glu Gin Val Val Ala Phe Leu Ala Val Val Arg Ala Gly 

6365 6370 6375 

Gly Thr Tyr Leu Pro lie Asp Pro Ala Tyr Pro Arg Asp Arg Val 

6380 6385 6390 

Asp Tyr Leu Val Arg Asp Ala Glu Pro Ala Cys Leu Leu Thr Val 

6395 6400 6405 

Ala Gly His Arg Ala Ala Ala Pro Ala Ala Pro Ala Val Val Glu 

6410 6415 6420 

Leu Asp Asp Pro Ala Thr Ala Ala Glu lie Ala Asp Ala Glu Pro 

6425 6430 6435 
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Glu Pro Pro Val Ala Val Arg Pro Thr His Ser Ala Tyr Leu lie 
6440 6445 6450 

Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val Val Val Thr 
6455 6460 6465 

His Arg Gly Val Ala Ala Leu Val Ala Thr Gin Ala Glu Arg Leu 
6470 6475 6480 

Ala Val Thr Gly Glu Ser Arg Val Leu Gin Phe Ala Ser Val Gly 
6485 6490 6495 

Phe Asp Ala Ser lie Trp Glu Met Val Met Ala Leu Cys Ala Gly 
6500 6505 6510 

Ala Thr Leu Val Val Ala Pro Ala Asp Asp Leu Leu Pro Gly Pro 
6515 6520 6525 

Ala Leu Ala Ala Thr Leu Ser Gly His Ala Val Thr His Ala Thr 
6530 6535 6540 

Leu Pro Pro Ala Val Leu Ala Ala Ser Ala Pro Gly Asp Leu Ala 
6545 6550 6555 

Pro Leu Ala Val Leu Val Ser Ala Gly Glu Ala Leu Gly Pro Asp 
6560 6565 6570 

Leu Val Arg Gin Phe Ala Pro Gly Arg Ala Leu Val Asn Ala Tyr 
6575 6580 6585 

Gly Pro Thr Glu Thr Thr Val Cys Ala Thr Ala Ser Ala Pro Leu 
6590 6595 6600 

Gly Pro Glu Asp Pro Pro His lie Gly Ala Pro Val Ala Asp Ser 
6605 6610 6615 

Arg Val Tyr Val Leu Asp Asp Ala Leu Thr Pro Val Pro Pro Gly 
6620 6625 6630 

Val Thr Gly Glu Leu Tyr Val Ser Gly Ala Ser Leu Ala Arg Gly 
6635 6640 6645 

Tyr Ala Gly Arg Ala Ala Leu Thr Ala Glu Arg Phe Val Ala Cys 
6650 6655 6660 

Pro Phe Ala Pro Gly Glu Arg Met Tyr Arg Thr Gly Asp Arg Ala 
6665 6670 6675 

Arg Trp Asp Ala Ala Gly Arg Leu Thr -Phe Ala Gly Arg Ala Asp 
6680 6685 6690 

Asp Gin Val Lys lie Arg Gly Phe Arg Val Glu Pro Gly Glu Val 
6695 6700 6705 

Ala Ala Val Leu Gly Glu His Pro Ala Val Ala Arg Ala Ala Val 
6710 6715 6720 

Val Ala Arg Thr Asp Gly Pro Gin Gly Ala Arg Leu Val Ala Tyr 
6725 6730 6735 
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Leu Val Ala Ala Asp Pro Ala Gly Pro Asp Leu Ala Ala Ala Val 
6740 6745 6750 



Arg Ala Tyr Ala Ala Ala Thr Leu Pro Ala His Leu Leu Pro Ala 
6755 6760 6765 

Ala Phe Val Pro Leu Asp Arg Leu Pro Leu Thr Thr Asn Gly Lys 
6770 6775 6780 

Leu Asp Arg Ala Ala Leu Pro Glu Pro Glu Thr Gly Ala Gly Arg 
6785 6790 6795 

Glu Pro Ser Gly Pro Val Glu Arg Leu Leu Cys Glu Ala Phe Ala 
6800 6805 6810 

Asp Val Leu Gly Leu Asp Arg Val Gly Ala Asp Gly His Phe Phe 
6815 6820 6825 

Asp Leu Gly Gly His Ser Leu Leu Ala Thr Arg Leu Leu Ser Arg 
6830 6835 6840 

Leu Arg Ser Ala Ala Gly lie Asp Val Pro Val Arg Val Leu Phe 
6845 6850 6855 

Glu Asn Pro Thr Pro Ala Gly Leu Ala Ala Trp Val Glu Thr His 
6860 6865 6870 

Ala Gly Ser Arg Arg Lys Ser' Arg Pro Ala Leu Arg Pro Met Arg 
6875 6880 6885 

His Gin Lys Glu Ser 
6890 

<210> 15 
<211> 8695 
<212> PRT 

<213> Actinoplanes sp. 
<400> 15 

Met lie Pro Leu Ser Phe Ala Gin Arg Arg Leu Trp Phe Leu Gly Arg 
15 10 15 

Leu Glu Gly Pro Ser Ala Thr Tyr Asn lie Pro Leu Val Leu Gly Leu 
20 25 30 

Thr Gly Thr Val Asp Ala Ala Ala Leu Glu Thr Ala Leu Arg Asp Val 
35 40 45 

Leu Glu Arg His Glu Val Leu Arg Thr Val Tyr Pro Asp Ala Gly Gly 
50 55 60 

Glu Pro His Gin Arg lie Leu Pro Leu Gly Glu Thr Gly Phe Gly Leu 
65 70 75 80 

Arg Val Ala Glu Val Thr Asp Gly Glu Leu Asp Ala Ala Val Ala Asp 
85 90 95 

Ala Thr Gly His Ala Phe Asp Leu Ala Thr Glu lie Pro Val Arg Ala 
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100 



105 



110 



Ser Leu Leu Thr Val Glu Pro Gly Arg His Val Leu Ala Leu Val Leu 
115 120 125 

His His lie Ala Ala Asp Gly Trp Ser Met Gly Pro Leu Leu Arg Asp 
130 135 140 

Leu Ser Thr Ala Tyr Thr Ala Arg Leu Ala Gly Gly Glu Pro Ala Trp 
145 150 155 160 

Ser Pro Leu Pro Val Gin Tyr Ala Asp Tyr Ala Leu Trp Gin Gin Glu 
165 170 175 

Val Leu Gly Ala Gly Asp Asp Pro Glu Ser Leu Leu Arg Glu Gin Val 
180 185 190 

Gly Tyr Trp Arg Ser Ala Leu Ala Gly Ala Pro Glu Glu Leu Arg Leu 
195 200 205 

Pro Ala Asp His Arg Arg Pro Pro Val Ser Ser Ser Arg Ala His Met 
210 215 220 

Ala Glu Phe Ala Val Pro Ala Ala Ala His Gly Asp Leu Thr Ala Leu 
225 230 235 240 

Thr Arg Glu Leu Gly Ala Thr Leu Phe Met Ala Val His Ala Ala Thr 
245 250 255 

Ala Met Val Leu Ser Gly Leu Gly Ala Gly Asp Asp Leu Pro lie Gly 
260 265 270 

Thr Val Val Ala Gly Arg Thr Asp Ala Gly Leu Asp Asp Leu Val Gly 
275 280 285 

Cys Phe Val Asn Asn Leu Val lie Arg Ala Asp Leu Thr Gly Asp Pro 
290 295 300 

Thr Phe Ala Asp Leu Leu Arg Gin Val Arg Glu Arg Ala Leu Asp Ala 
305 310 315 320 

Tyr Gly His Gin Asp Val Pro Phe Glu Lys Leu Val Glu Glu Leu Ala 
325 330 335 

Pro Ser Arg Ser Leu Ser Arg His Pro Leu Phe Gin Val Ala Val Ala 
340 345 350 

Val Glu Thr Asp Asp Leu lie Gly Gly Arg Gly Gly Gly Pro Ala Leu 
355 360 365 

Arg Leu Pro Gly Leu Gly lie Glu Val Leu Pro Gly Glu Pro Ser Ala 
370 375 380 

Arg Asp Leu Asp Leu Asp Leu Val Val Arg Glu Thr Phe Asp Ala Glu 
385 390 395 400 

Gly Arg Pro Ala Gly Leu Thr Gly Ala Leu lie Gly Ala Ala Gly Leu 
405 410 415 



Phe Asp Ala Ala Ser Val Glu Arg Leu Ala Ala Leu Leu Ala Arg Ala 
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420 



425 



430 



Leu Glu Ala Leu Ala Ala Asp Pro Arg Thr Arg Ala Gly Asp Leu Asp 
435 440 445 

Leu Leu Ser Pro Ala Asp Arg Arg Leu lie Leu Arg Gly Trp Asn Asp 



Thr Ala Ala Pro Ala Pro Ala Gly Leu Val Pro Asp Leu Phe Ala Ala 
465 470 475 480 

Gin Ala Ala Arg Thr Pro Asp Ala Val Ala Val Ala Gly Pro Asp Arg 
485 490 495 

Glu Leu Thr Tyr Ala Glu Leu Asp Glu Arg Ser Gly Arg Leu Ala Arg 
500 505 510 

Trp Leu lie Arg Arg Gly Val Ala Ala Asp Thr Arg Val Ala Leu Val 
515 520 525 

Leu Glu Arg Ser Ala Glu Leu Pro Val Ala lie Leu Ala Val Leu Lys 
530 535 540 

Ala Gly Gly Ala Tyr Leu Pro lie Asp Pro Ala Gin Pro Pro Arg Arg 
545 550 555 560 

lie Ala Asp lie Val Ala Asp Ala Ala Pro Ala Leu Val Leu Ala Gin 
565 570 575 

Ala Ser Thr Ala Asp Val Val Ala Asp Ala Ser Pro Ala Leu Val Leu 
580 585 590 

Ala Pro Ala Ser Asp Gly Val Pro Thr Gly Ala Val Pro Val His Leu 
595 600 605 

Leu Asp Ser Pro Ala Val Arg Asp Glu Val Ala Gin Cys Pro Ala Gly 
610 615 620 

Ala Val Thr Asp Ala Asp Arg Arg Gly Val Leu Leu Gly Gly His Ala 
625 630 635 640 

Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys Gly Val 
645 650 655 

Val Val Ser His Asp Ala Phe Ala Asn Leu Val Leu Asp Glri Arg Arg 
660 665 670 

Leu Gly lie Gly Pro Gly Ser Arg Val Ala Gin Phe Ala Ser Pro Gly 
675 680 685 

Phe Asp Met Phe Val Asp Glu Trp Ser Met Ala Leu Leu Ala Gly Ala 
690 695 700 

Ala Leu Val lie Val Pro Pro Glu Arg Arg Leu Gly Ala Asp Leu Ala 
705 710 715 720 

Ala Phe Leu Thr Glu Arg Gly Val Thr His Ala Thr Leu Pro Pro Ala 



450 



455 



460 



725 



730 



735 



Val Val Ala Thr Leu Pro Glu Glu Ser Leu Pro Arg Ser Phe Val Leu 
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740 



745 



750 



Asp lie -Gly Gly Asp Ala Leu Pro Asp Asp Leu Ala Arg Arg Trp Leu 
755 760 765 

Arg Asp Gly Arg Trp Leu Gly Asn Ser Tyr Gly Pro Thr Glu Thr Thr 
770 775 780 

Val Asn Ala Ala Thr Trp Arg Cys Glu Pro Gly Thr Trp Glu Gly Ala 
785 790 795 800 

Thr Pro lie Gly Arg Pro Val Ala Asn Leu Arg Ala Tyr Val Leu Asp 
805 810 815 

Gly Arg Leu Arg Pro Val Pro Val Gly Val Glu Gly Glu Leu Tyr Val 
820 825 830 

Ser Gly Ala Gly Leu Ala Arg Gly Tyr Leu Asn Arg Ala Gly Leu Thr 
835 840 845 

Ala Gly Ser Phe Val Ala Cys Pro Phe Glu Pro Gly Glu Arg Met Tyr 
850 855 860 

Arg Thr Gly Asp lie Val Arg Trp Asp Ala Arg Gly Arg Leu Val Tyr 
865 870 875 880 

Ala Gly Arg Ala Asp Asp Gin Ala Lys lie Arg Gly Phe Arg Val Glu 
885 890 895 

Pro Gly Glu Val Glu Ala Val Leu Ala Ala Gly Pro Gly Val Asn Gin 
900 905 910 

Val Ala Val lie Val Arg Glu Asp Val Pro Gly Asp Lys Arg Leu Val 
915 920 925 

Ala Tyr Val Val Gly Gly Asp Val Glu Thr Leu Arg Ser Tyr Ala Gin 
930 935 940 

Gin Arg Leu Pro Gly Tyr Leu Val Pro Ser Ala lie Val Ala Leu Ala 
945 950 955 960 

Glu Leu Pro Leu Thr Pro Ser Ala Lys Val Asp Arg Arg Ala Leu Pro 
965 970 975 

Val Pro Asp Tyr Gly Arg Asp Ala Gly Gly Gly Arg Ala Pro Ala Asn 
980 985 990 

Ala Arg Glu Glu Val Leu Cys Arg Ala Phe Ala Glu Val Leu Gly Val 
995 1000 1005 

Glu Arg Val Gly Val Glu Asp Asp Phe Phe Ala Leu Gly Gly His 
1010 1015 1020 

Ser Leu Leu Val Val Ser Leu Val Glu Arg Leu Arg Arg Gin Gly 
1025 1030 1035 

lie Ser Val Pro Val Arg Ala Leu Phe Thr Thr Pro Thr Pro Ala 
1040 1045 1050 



Gly Leu Ala Glu Ala Val Gly Asp Gly Ala 
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Val 



Val Val Pro Pro 



1055 



1060 



1065 



Asn Leu lie Pro Glu Gly Ala Ala Glu Leu Thr Pro Glu Met Leu 
1070 1075 1080 

Pro Leu Ala Asp Leu Thr Ala Asp Glu Leu Ala Val Val Val Asp 
1085 1090 1095 

Ser Val Pro Gly Gly Ala Ala Asn He Ala Asp Val Tyr Pro Leu 
1100 1105 1110 

Ala Pro Leu Gin Glu Gly He . Phe Phe His His Met Met Ala Asp 
1115 1120 1125 

Arg Asp Ser Ala Asp Val Tyr Val Thr Pro Thr Val Val Glu Phe 
1130 1135 1140 

Asp Ser Arg Asp Arg Leu Asp Gly Phe Leu Ala Ala Leu Gin Gin 
1145 1150 1155 

Val Val Asp Arg Thr Asp Val Tyr Arg Thr Ser Val Val Trp Gin 
1160 1165 1170 

Gly Leu Arg Glu Pro Val Gin Val Val Trp Arg His Ala Arg Leu 
1175 1180 1185 

Pro Val Asp Glu Val Val Leu Arg Asp Asp Leu Asp Pro Val Glu 
1190 1195 1200 

Gin Leu Asn Ala Leu Gly Thr Ala Trp Met Asp Leu Ser Glu Ala 
1205 1210 1215 

Pro Leu Val Gin Ala Val Val Ala Ala Arg Pro Gly Asp Pro Gin 
1220 1225 1230 

Arg Trp ' Leu Ala Val Leu Arg He His His Leu Val Gin Asp His 
1235 1240 1245 

Thr Ala Leu Asp He Leu Leu Glu Glu Leu Ala Ala Tyr Leu Ala 
1250 1255 1260 

Gly Arg Gly Gly Asp Leu Pro Glu Pro Val Pro Phe Arg Glu Phe 
1265 1270 1275 

Val Ala His Thr Arg Leu Gly Val Pro Arg Glu Glu His Glu Arg 
1280 1285 1290 

Tyr Phe Ala Gly Leu Leu Gly Asp Val Thr Glu Thr Thr Ala Pro 
1295 1300 1305 

Tyr Gly Leu Leu Asp Val His Ser Gly Gly Leu Ala Ser Ala Gin 
1310 1315 1320 

Ala His Leu Arg Leu Asp Gly Pro Leu Gly Arg Arg Val Ala Ala 
1325 - 1330 1335 

Phe Ala Arg Glu His Gly Val Ser Pro Ala Thr Leu Phe His Leu 
1340 1345 1350 



Ala Trp Ala Arg Val Leu Gly Thr Leu Ala Gly Arg Asp Asp Val 

98 



1355 



1360 



1365 



Val Phe Gly Thr Val Leu Phe Gly Arg Met Asn Ser Gly Ala Gly 
1370 1375 1380 

Ala Asp Arg Val Pro Gly Leu Phe lie Asn Thr Leu Pro Val Arg 
1385 1390 1395 

Val Arg Leu Gly Ala Pro Val Gly Asp Ala Leu Asp Gly Leu Arg 
1400 1405 1410 

Asp Gin Leu lie Glu Leu lie Ala His Glu His Ala Pro Leu Ala 
1415 1420 1425 

Val Ala Gin Gin Ala Ala Asn Leu Phe Gly Arg Pro Leu Phe Thr 
1430 1435 1440 

Ser lie Phe Asn Tyr Arg Tyr Ala Arg Gly Ala Glu Pro Ala Gly 
1445 1450 1455 

Ala Ala Leu Asp Gly lie Arg Leu Leu Ser Ala Arg Asp Leu Thr 
1460 1465 1470 

Asn Tyr Pro Leu Ala Val Ala Val Asp Ala Glu Gly Asp Thr Phe 
1475 1480 1485 

Ser Leu Thr Val Asp Ala Val Ala Pro Ala Asp Pro Val Gin Val 
1490 1495 1500 

Gly Glu Leu Leu Val Thr Ala Leu Arg Asn Leu Thr Arg Thr Ala 
1505 1510 1515 

Glu Asn Ala Pro Gly Thr Pro Leu Ala Ala Val Gly Val Leu Gly 
1520 1525 1530 

Glu Asp Glu Leu Ser Arg Val Val Ser Gly Trp Asn Asp Thr Ala 
1535 1540 1545 

Arg Arg Val Arg Gin Ala Ser Val Pro Glu Leu Phe Ala Glu Arg 
1550 1555 ' 1560 

Val Ala Ala Ala Pro Gly Ala Pro Ala Val Ala Ala Gly Asp Leu 
1565 1570 1575 

Arg Trp Thr Tyr Ala Asp Leu Asp Ala Arg Ser Asp Ala Leu Ala 
1580 1585 1590 

Arg Ser Leu Val Ala Ala Gly Val Thr Ala Glu Ser Pro Val Val 
1595 1600 1605 

Val Ala Leu Glu Arg Ser Ala Asp Val Leu Thr Ala Phe Leu Ala 
1610 1615 1620 

Val Ala Lys Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp 
1625 1630 1635 

Pro Arg Ala Arg Val Asp Ala Val lie Ala Asp Cys Ala Ala Trp 
1640 1645 ' 1650 



lie Ala Val Ala Asp Arg Pro Met Thr Gly Leu Thr Val Val Pro 
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1655 



1660 



1665 



Ala Asn Arg Ala Gly Asp Pro Ala Val Ala Leu Pro Pro Arg Pro 
1670 1675 1680 

Leu Pro Gly Ala Ala Ala Tyr Arg Met Tyr Thr Ser Gly Ser Thr 
1685 1690 1695 

Gly Arg Pro Lys Gly Val Val Thr Thr His Gin Asn Val Val Asp 
1700 1705 1710 

Leu Val Thr Asp Arg Cys Trp Gly Pro Thr Pro Arg Val Leu Phe 
1715 1720 1725 

His Ala Pro His Ala Phe Asp Ala Ser Ser Phe Glu Leu Trp Val 
1730 1735 1740 

Pro Leu Leu Thr Gly Gly Thr Val Val Val Ala Pro Gly Glu Ser 
1745 1750 1755 

lie Asp Thr Gly Val Leu Arg Gin Leu lie Arg Ala His Glu Leu 
1760 1765 1770 

Thr His Val His Val Thr Ala Gly Leu Leu Arg Val Leu Ala Glu 
1775 1780 1785 

Asp Pro Ser Cys Phe Ala Gly Leu Thr Glu Val Leu Thr Gly Gly 
1790 1795 1800 

Asp Val Val Pro Ala Glu Ala Val Arg Arg Val Leu Asp Ala Asn 
1805 1810 1815 

Pro Gly Val Arg Val Arg Gin Leu Tyr Gly Pro Thr Glu Val Thr 
1820 1825 1830 

Leu Cys Ala Thr Gin His Val Val Arg Glu Pro Ser Pro Val Leu 
1835 1840 1845 

Pro lie Gly Arg Pro Leu Asp Asn Thr Arg Val Tyr Val Leu Asp 
1850 1855 1860 

Gly Leu Leu Gin Pro Val Pro Val Gly Val Thr Gly Glu Leu Tyr 
1865 1870 1875 

lie Ala Gly Ala Gly Val Ala Arg Gly Tyr Ala Asp Met Pro Gly 
1880 1885 1890 

Thr Thr Ala Glu Arg Phe Val Ala Asp Pro Phe Thr Ala Gly Gly 
1895 1900 1905 

Arg Leu Tyr Arg Thr Gly Asp Leu Val Arg Trp Thr Gly Glu Gly 
1910 1915 1920 

Glu Leu Val Phe Ala Gly Arg Ala Asp Asp Gin Val Lys lie Arg 
1925 1930 1935 

Gly Tyr Arg Val Glu Pro Gly Glu Val Glu Ala Val Leu Ala Ala 
1940 1945 1950 



Leu Pro Gly Val Ser Gin Ala Ala Val lie Val Arg Glu Asp Val 
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1955 



1960 



1965 



Pro Gly -Asp Lys Arg Leu Val Ala Tyr Leu Val Ala Ala Pro Glu 
1970 1975 1980 

Thr Val Glu Ala Ala Arg Ala His Ala Glu Gin Arg Leu Pro Ser 
1985 1990 1995 

Tyr Leu Val Pro Ser Ala Phe Val Gin Leu Asp Ala Leu Pro Leu 
2000 2005 2010 

Thr Gly Asn Gin Lys Val Asp Arg Ala Ala Leu Pro Ala Pro Leu 
2015 2020 2025 

Gly Phe Glu Ala Gly Ala Gly Arg Ala Pro Ala Asp Ala Arg Glu 
2030 2035 2040 

Glu Leu Val Gly Ala Ala Phe Ala Glu Val Leu Asp Leu Gly Arg 
2045 2050 2055 

Val Gly Pro Asp Asp Asp Phe Phe Ala Leu Gly Gly His Ser Leu 
2060 2065 2070 

Leu Ala Leu Ala Leu Val Glu Arg Leu Arg Arg Gin Gly Leu Gly 
2075 2080 2085 

Val Ser Val Arg Ala Val Phe Asp Ala Arg Thr Pro Ala Ala Leu 
2090 2095 2100 

Thr Arg Arg Gly Asp Gly Gly Ala Asp Asp Arg Pro Ala Leu Arg 
2105 2110 2115 

Ala Gly Ala Arg Pro Ala Arg Leu Pro Leu Ser Tyr Ala Gin Arg 
2120 2125 2130 

Arg Leu Trp Phe Leu Ala Gin Leu Glu Gly Pro Ser Ala Thr Tyr 
2135 2140 2145 

Asn lie Pro Val Ala Leu Arg Leu Glu Gly Asp Leu Asp Arg Asp 
2150 2155 2160 

Ala Leu Thr Ala Ala Leu Arg Asp Val Val Ala Arg His Glu Val 
2165 * 2170 ' 2175 

Leu Arg Thr Val Phe Thr Val Ala Asp Gly Glu Pro Trp Gin His 
2180 2185 2190 

lie Leu Asp Pro Ala Arg Ala Glu Pro Ala Leu Pro Val Val Asp 
2195 2200 2205 

Val Pro Ala Gly Arg Val Glu Glu Ala Val Ala Glu Ala Ala Ala 
2210 2215 2220 

Tyr Ala Phe Asp Leu Ala Arg Glu lie Pro Leu Arg Ala Val Leu 
2225 2230 2235 

Leu Ala Pro Gly Asp Gly Thr His Val Leu Val Leu Val Leu His 
2240 2245 2250 



His lie Ala Ala Asp Gly Trp Ser Met Arg Pro Leu Ala Arg Asp 
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2255 



2260 



2265 



Leu Ala Thr Ala Tyr Ala Ala Arg Arg Arg Gly Gin Ala Pro Glu 
2270 2275 2280 

Ser Glu Thr Leu Pro Val Gin Tyr Ala Asp Tyr Ala Leu Trp Gin 
2285 2290 2295 

Arg Asp Leu Leu Gly Ser Asp Ser Asp Pro Ala Ser Leu lie Ser 
2300 2305 2310 

Arg Gin lie Ala His Trp Arg Glu Arg Leu Asp Gly Val Pro Glu 
2315 2320 2325 

Glu Leu Asp Leu Pro Ala Asp Arg Pro Arg Pro Ala Ala Ala Ser 
2330 2335 2340 

His Arg Gly His Leu His Ser Ala Glu lie Pro Ala Asp Val His 
2345 2350 2355 

Arg Ser Leu Arg Arg Val Ala Ala Asp His Gly Ala Thr Val Phe 
2360 2365 2370 

Met Thr Leu Gin Ala Ala Val Ala Val Leu Leu Ser Arg Leu Gly 
2375 2380 2385 

Ala Gly Thr Asp Val Pro He Gly Thr Val Val Ala Gly Arg Ala 
2390 2395 2400 

Asp Arg Ala Leu Glu Asn Leu Val Gly Phe Phe Val Asn Thr Leu 
2405 2410 2415 

Val Leu Arg Thr Asp Leu Thr Gly Asp Pro Arg Leu Thr Asp Val 
2420 2425 2430 

Leu Gly Gin Val Arg Glu Leu Thr Leu Arg Ala Leu Ala His Gin 
2435 2440 2445 

Asp Val Pro Phe Glu Lys Leu Val Glu Glu Leu Thr Pro Ala Arg 
2450 2455 2460 

Ser Leu Ala Arg His Pro Leu Phe Gin Val Met Val Thr Leu Asp 
2465 2470 2475 

Gly Gly Gly Pro Asp Gly Ala Glu Leu Pro Gly Leu Ala Met Ser 
2480 2485 2490 

Val Val Pro Thr Gly Ala Val Pro Ala Lys Phe Asp Leu Asp Leu 
2495 2500 2505 

Thr Phe Thr Glu Thr Phe Asp Ala Ala Gly Glu Pro Ala Gly Leu 
2510 2515 2520 

Arg Val Asp Leu He Ala Ala Ala Asp Leu Phe Asp Ala Gly Thr 
2525 2530 2535 

Ala Ala Arg Leu Ala Gly Tyr Leu Ser Arg Val Leu Gly Val Leu 
2540 2545 2550 



Ala Ala Asp Pro Arg Arg Arg Leu Ala Glu Val Asp Pro Leu Glu 
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2555 



2560 



2565 



Ala Glu Glu Ser Arg Leu Met Leu Ala Ala Gly Glu Glu Pro Ala 
2570 2575 2580 

Pro Ala Leu Pro Glu lie Thr Val Ala Ala Leu Val Ala Glu Gin 
2585 2590 2595 

Cys Ala Arg Thr Pro Gly Ala Val Ala Val Thr Gly Pro Asp Ala 
2600 2605 2610 

Ser Leu Thr Tyr Ala Glu Leu Asp Glu Arg Ala Ala Arg lie Ala 
2615 2620 2625 

Arg Trp Leu Arg Arg His Gly Ala Gly Pro Gly Ala Ala Val Cys 
2630 2635 2640 

Val Leu Met Glu Arg Ser Ala Glu Leu Val Ala Val Leu Leu Gly 
2645 2650 2655 

Val Met Arg Ala Gly Ala Ala Tyr Val Pro Val Asp Pro Ala Tyr 
2660 2665 2670 

Pro Ala * Glu Arg lie Arg Phe Val Val Thr Asp Ala Arg Ala Ala 
2675 2680 2685 

Cys Val Val Ser Glu Ser Ala Ser Ala Gly Leu Val Pro Asp Gly 
2690 2695 2700 

Val Pro Cys Leu Ala lie Asp Asp Pro Ala Ala Ala Ala Glu Pro 
2705 2710 2715 

Ala Glu Pro Gly Asp Asp Pro Gly Asp Ala Ala Gly Pro Arg Pro 
2720 2725 2730 

Asp Asp Pro Ala Tyr lie lie Tyr Thr Ser Gly Ser Thr Gly Thr 
2735 2740 2745 

Pro Lys Gly Val Val Val Ser His Arg Asn Val Val Ala Leu Leu 
2750 2755 2760 

Thr Ala Thr Arg Pro Leu Phe Gly Phe Ala Gly Asp * Glu Val Trp 
2765 2770 2775 

Ser Trp Phe His Ser Val Ala Phe Asp Phe Ser Val Trp Glu Leu 
2780 2785 2790 

Trp Gly Ala Leu Thr His Gly Gly Arg Val Val Val Val Pro Tyr 
2795 2800 2805 

Ala Val Ser Arg Ser Pro Arg Asp Phe Trp Glu Leu Val Val Arg 
2810 2815 2820 

Glu Gly Val Thr Val Leu Ser Gin Thr Pro Ser Ala Phe Ala Gin 
2825 2830 2835 

Leu Met Ala Ala Ala Gly Asp Asp Asp Arg Asp Ala Leu Arg Phe 
2840 2845 2850 



Val Val Phe Gly Gly Glu Ala Leu Asp Pro Gly Arg Leu Ala Gly 
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2855 



2860 



2865 



Trp Leu Ala Arg Arg Pro Asp Lys Pro Arg Leu Val Asn Met Tyr 
2870 2875 2880 

Gly He Thr Glu Thr Thr Val His Thr Thr Tyr Gin His He Ala 
2885 2890 2895 

Pro Gly Thr Thr Gly Ser Val He Gly Arg Gly Leu Pro Gly Phe 
2900 2905 2910 

Gly Leu Tyr Val Leu Asp Glu Ala Leu Arg Pro Val Pro Ala Gly 
2915 2920 2925 

Val Pro Gly Glu Val Tyr Ala Arg Gly Pro Gin Val Ala Arg Gly 
2930 2935 2940 

Tyr He Gly Arg Pro Gly Leu Thr Ala Glu Arg Phe Val Ala Ser 
2945 2950 2955 

Pro Phe Ala Pro Gly Glu Arg Met Tyr Arg Thr Gly Asp Val Ala 
2960 2965 2970 

Arg Trp Thr Ala Asp Gly Arg Leu Val Phe Ala Gly Arg Ser Asp 
2975 2980 2985 

Asp Gin He Lys He Arg Gly Phe Arg He Glu Pro Gly Glu Val 
2990 2995 3000 

Glu Ala Val Leu Ala Ala Gly Pro Gly Val Ser Gin Ala Ala Val . 
3005 3010 3015 

He Val Arg Glu Asp Val Pro Gly Asp Lys Arg Leu Val Ala Tyr 
3020 3025 3030 

Val Val Gly Gly Asp Ala Glu Thr Leu Arg Ser His Ala Gin Gin 
3035 3040 3045 

Arg Leu Pro Gly Tyr Leu Val Pro Ser Ala Phe Val Glu Leu Asp 
3050 3055 3060 

Arg Leu Pro Leu Thr Val Asn Gly Lys Leu Asp Arg Arg Ala Leu 
3065 3070 3075 

Pro Val Pro Asp Tyr Gly Arg Asp Ala Gly Gly Gly Arg Ala Pro 
3080 3085 3090 

Ala Asn Ala Arg Glu Glu Val Leu Cys Arg Ala Phe Ala Glu Val 
3095 3100 3105 

Leu Gly Val Glu Arg Val Gly Val Glu Asp Asp Phe Phe Ala Leu 
3110 3115 3120 

Gly Gly His Ser Leu Leu Val Val Ser Leu Val Glu Arg Leu Arg 
3125 3130 3135 

Arg Gin Gly He Ser Val Pro Val Arg Ala Leu Phe Thr Thr Pro 
3140 3145 3150 



Thr Pro Ala Gly Leu Ala Glu Ala Val Gly Asp Gly Ala Val Val 
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3155 



3160 



3165 



Val Pro Pro Asn Leu lie Pro Glu Asp Ala Ala Glu Leu Thr Pro 
3170 3175 3180 

Glu Met Leu Pro Leu Ala Asp Leu Thr Ala Asp Glu Leu Ala Val 
3185 3190 3195 

Val Val Ala Ser Val Pro Gly Gly Ala Ala Asn He Ala Asp Val 
3200 3205 3210 

Tyr Pro Leu Ala Pro Leu Gin Glu Gly He Phe Phe His His Met 
3215 3220 3225 

Met Ala Asp Arg Asp Ser Ala Asp Val Tyr Val Thr Pro Thr Val 
3230 3235 3240 

Val Glu Phe Asp Ser Arg Asp Arg Leu Asp Gly Phe Leu Ala Ala 
3245 3250 3255 

Leu Gin Gin Val Val Asp Arg Thr Asp Val Tyr Arg Thr Ser Val 
3260 3265 3270 

Val Trp Gin Gly Leu Arg Glu Pro Val Gin Val Val Trp Arg His 
3275 3280 3285 

Ala Arg Leu Pro He Asp Glu Val Glu Leu His Glu Gly Thr Asp 
3290 3295 3300 

Pro Ala Glu Gin Leu He Ala Leu Ala Thr Glu Arg Val Asp Leu 
3305 3310 3315 

Asp Arg Ala Pro Leu He Arg Thr Thr Thr Ala Ala Val Pro Gly 
3320 3325 3330 

Ser Gly Arg Trp Leu Ala Leu Leu Arg He His His Leu Val Gin 
3335 3340 3345 

Asp His Thr Thr Leu Asp Val Leu Leu Gly Glu Leu Arg Ala Phe 
3350 3355 3360 

Leu Glu Gly Arg Gly Asp Glu Leu Pro Glu Pro Val Pro Phe Arg 
3365 3370 3375 

Glu Phe Val Ala Gin Ala Arg Leu Gly Val Pro Arg Glu Glu His 
3380 3385 3390 

Glu Arg Tyr Phe Ala Glu Leu Leu Gly Asp Val Thr Glu Thr Thr 
3395 3400 3405 

Ala Pro Tyr Gly Leu Thr Glu Val His Gly Asp Gly Ser Ala Ala 
3410 3415 3420 

Val His Ser Arg Arg Glu Val Asp Asp Asp Leu Ala Ala Arg Leu 
3425 3430 3435 

His Arg Leu Ala Arg Ser Leu Gly Val Ser Pro Ala Ala Leu Phe 
3440 3445 3450 



His Leu Ala Trp Ala Arg Val Leu Gly Ala Val Ser Gly Arg Asp 
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3455 



3460 



3465 



Asp Val Val Phe Gly Thr Val Leu Phe Gly Arg Met Asn Ser Gly 

3470 3475 3480 

Ala Ala Ala Asp Arg Val Gin Gly Leu Phe lie Asn Thr Leu Pro 

3485 3490 3495 

Val Arg Val Arg Leu Ala Ala Gly Ser Thr Arg Asp Ala Leu Thr 

3500 3505 3510 

Gly Leu Arg Asp Gin Leu Ala Gly Leu Leu Val His Glu His Ala 

3515 3520 3525 

Pro Leu Ala Leu Ala Gin Arg Ala Ala Gly lie Thr Asp Gly Ser 

3530 3535 3540 

Pro Leu Phe Ala Ser lie Phe Asn Tyr Arg His Asn Gin Asp Asp 

3545 3550 3555 

Pro Ala Ala Ser Ala Gly Leu Glu Gly lie Arg Thr Val Tyr Ser 

3560 3565 3570 

Ala Glu His Thr Asn Tyr Pro Leu Asp Ala Ser lie Asp Val Thr 

3575 3580 3585 

Gly Asp Arg Phe Ala lie Thr Val Asn Ala Val Ala Pro Ala Asp 

3590 3595 3600 

Ala Ala Arg lie Ala Glu Leu Met His Thr Cys Leu Gly His Leu 

3605 3610 3615 

Ala Asp Val Leu Glu Asp Ala Pro Glu Thr Pro Leu Ser Trp Val 

3620 3625 3630 

Ser Pro Leu Ser Ala Glu Asp Leu Gly Arg lie Val Gly Asp Trp 

3635 3640 3645 

Asn Glu Thr Arg Arg Ala Val Thr Arg Ala Ser Val Pro Glu Leu 

3650 3655 3660 

Phe Ala Lys Gin Val Ala Ala Thr Pro Asp Ala He Ala Val Ala 

3665 3670 3675 

Gly Glu Gly Val Ser Trp Ser Tyr Arg Glu Leu Asp Val Arg Ser 

3680 3685 3690 

Asp Ala Leu Ala Arg Ser Leu Val Ala Ala Gly Val Gly He Glu 

3695 3700 3705 

Ser Pro Val Val Val Ala Leu Asp Arg Ser Pro Glu Val Pro Thr 

3710 3715 3720 

Ala Phe Leu Ala Val Ala Lys Ala Gly Gly Val Phe Val Pro Val 

3725 3730 3735 

Asp Leu Ser Trp Pro Gin Ala Arg Val Asp Ala Val He Ala Asp 

3740 3745 3750 



Cys Ala Ala Arg Val Ala Val Ala Asp Arg Pro 
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Met Thr Gly Leu 



3755 



3760 



3765 



Thr Val Val Pro Ala Asp Ala Ala Gly Asp Pro Ala Ala Glu Leu 
3770 • 3775 3780 

Pro Pro Arg Pro Leu Pro Gly Ala Glu Val Tyr Arg Met Tyr Thr 
3785 3790 3795 

Ser Gly Ser Thr Gly Arg Pro Lys Gly Val Val Thr Thr His Gin 
3800 3805 3810 

Asn Leu Val Asp Leu Ala Thr Asp Thr Cys Trp Gly Pro Thr Pro 
3815 3820 3825 

Arg Val Leu Phe His Ala Pro His Ala Phe Asp Ala Ser Ser Tyr 
3830 3835 3840 

Glu lie Trp Val Pro Leu Leu Asn Gly Gly Thr Val Val Val Ala 
3845 3850 3855 

Pro Gly Arg Ser lie Asp Ala Ala Val Leu Gly Glu Leu lie Arg 
3860 3865 3870 

Ala His Glu Leu Thr His Val His Val Thr Ala Gly Leu Leu Arg 
3875 3880 3885 

Val Leu Asp Pro Ser Cys Phe Ala Gly Leu Thr Glu Val Leu Thr 
3890 3895 3900 

Gly Gly Asp Ala Val Ser Ala Glu Ala Val Arg Arg Val Met Glu 
3905 3910 3915 

Ala Asn Pro Gly Leu Arg Val Arg Gin Leu Tyr Gly Pro Thr Glu 
3920 3925 3930 

Val Thr Leu Cys Ala Thr Gin Gin Val Leu Asp Gly Thr Gly Val 
3935 3940 3945 

Pro lie Gly Arg Pro Leu Asp Asn Thr Arg Val Tyr Val Leu Asp 
3950 3955 3960 

Asp Leu Leu Gin Pro Val Pro Val Gly Val Thr Gly Glu Leu Tyr 
3965 3970 3975 

Val Ala Gly Ala Gly Leu Ala Arg Gly Tyr Ala Gly Met Pro Gly 
3980 3985 3990 

Leu Thr Ala Glu Arg Phe Val Ala Asp Pro Phe Ser Ser Gly Gly 
3995 4000 4005 

Arg Leu Tyr Arg Thr Gly Asp Leu Val Arg Trp Thr Asp Asp Gly 
4010 4015 4020 . 

Val Leu Val Phe Ala Gly Arg Ala Asp Asp Gin Val Lys lie Arg 
4025 4030 4035 

Gly Tyr Arg Val Glu Pro Gly Glu Val Glu Ala Val Leu Ala Ala 
4040 4045 4050 



His Pro Asp Val Ala Gin Val Ala Val Val Val 
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Arg Glu Asp Thr 



4055 



4060 



4065 



Pro Gly Asp Lys Arg Leu Val Ala Tyr Val Val Gly Gly Asp Val 
4070 4075 4080 

Glu Ala Tyr Ala Gin Glu Arg Leu Pro Gly Tyr Leu Val Pro Ser 
4085 4090 4095 

Ala Phe Val His Leu Asp Ala Leu Pro Leu Thr Ser Asn Gin Lys 
4100 4105 4110 

Val Asp Arg Ala Ala Leu Pro Ala Pro Ser Val Glu Ser Gly Ala 
4115 4120 4125 

Gly Arg Ala Pro Ala Asp Ala Arg Glu Glu Leu Met Cys Ala Ala 
4130 4135 4140 

Phe Ala Glu Val Leu Asp Leu Asp Arg Val Gly Val Asp Asp Asp 
4145 4150 4155 

Phe Phe Ala Leu Gly Gly His Ser Leu Leu Val Val Arg Leu Val 
4160 4165 4170 

Gly Arg He Arg Gin Val Phe Gly Val Glu Val Ser Ala Arg Leu 
4175 4180 4185 

Val Phe Asp Ala Arg Thr Pro Ala Gly Val Val Ala Arg Leu Ser 
4190 4195 4200 

Glu Gly Gly Thr Ala Arg Glu Ala Val Arg Ala Arg Val Arg Pro 
4205 4210 4215 

Ala Arg Val Pro Leu Ser Phe Ala Gin Arg Arg Leu Trp Phe Leu 
4220 4225 4230 

Ser Gin Leu Asp Gly Thr Ser Thr Thr Tyr Asn He Pro Val Ala 
4235 4240 4245 

Leu Gin Leu Asp Gly Pro Leu Asp Arg Asp Ala Phe Thr Ala Ala 
4250 4255 4260 

Leu His Asp Val Val Ala Arg His Glu Val Leu Arg Thr Val Phe 
4265 4270 4275 

Thr Val Ala Asp Gly Glu Pro Trp Gin His He Leu Asp Thr Pro 
4280 4285 4290 

Ser Val Ser Val Pro Val He Glu Val Pro Ala Asp Gly Leu Pro 
4295 4300 4305 

Glu Ala Val Ala Ala Ala Ala Ala His Thr Phe Asp Leu Ser Arg 
4310 4315 4320 

Glu He Pro Leu Arg Ala Val Leu Leu Ala Thr Gly Ala Asp Arg 
4325 4330 4335 

His Val Leu Val Leu Val Val His His He Ala Ala Asp Gly Trp 
4340 4345 4350 



Ser Met Gin Pro Leu Ala Arg Asp Leu Ala Val 
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Ala Tyr Ala Ala 



4355 4360 4365 

Arg He Arg Gly Glu Ala Pro Ala Trp Thr Ala Leu Pro Val Gin 
4370 . 4375 4380 

Tyr Ala Asp Tyr Ala Leu Trp Gin Arg Asp Val Leu Gly Ser Glu 
4385 4390 4395 

His Asp Pro Asp Ser Ala He Ser Gin Gin Val Ala His Trp Arg 
4400 4405 4410 

Arg Gin Leu Ala Gly Ala Pro Asp Glu Leu Pro Leu Pro Ala Asp 
4415 4420 4425 

His Pro Arg Pro Ala Glu Ala Thr Tyr Arg Gly His Thr Val Glu 
4430 4435 4440 

Phe Thr Val Pro Pro Ala Val His His Gin Leu Ala Glu Leu Ala 
4445 4450 4455 

Arg Arg Asn Gly Val Thr Val Phe Met Thr Val Gin Thr Ala Leu 
4460 4465 4470 

Ala Val Leu Leu Ser Lys Leu Gly Ala Gly Thr Asp He Pro He 
4475 4480 4485 

Gly Val Ala Val Ala Gly Arg Thr Asp Pro Thr Leu Asp Asn Leu 
4490 4495 4500 

He Gly Phe Phe Val Asn Thr Leu Val Leu Arg Thr Asp Leu Thr 
4505 4510 4515 

Gly Asn Pro Thr He Thr Asp Leu Leu His Arg Thr Arg Asp Thr 
4520 4525 4530 

. Thr Leu His Ala Phe Thr His Gin Asp Val Pro Phe Glu Lys Leu 
4535 4540 4545 

Val Glu Asp Leu Ala Pro Thr Arg Ser Leu Ala Arg His Pro Leu 
4550 4555 4560 

Phe Gin Val Met Met Thr Leu Gin Ser Thr Gly Arg Ala Gly Glu 
4565 4570 4575 

Ala Ala Glu Leu Pro Gly Leu Glu Thr Ala Val Leu Ser Pro Gly 
4580 4585 4590 

Gly Val Ala Ala Lys Val Asp Leu Asp Leu Ser Leu Ser Glu Ala 
4595 4600 4605 

Tyr Asp Asp Asp Gly Arg Pro Ala Gly Leu Ala Gly Thr Leu Val 
4610 4615 4620 

Ala Ala Ala Asp Leu Phe Glu His Gly Thr Ala Glu Arg He Ala 
4625 4630 4635 

Gly Tyr Leu Ala Arg Leu Leu Ala Val Leu Pro Ala Asp Pro Gly 
4640 4645 4650 



Ala Arg Leu Gly Asp Val Asp Leu Leu Asp Gly Glu Glu Arg Arg 
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4655 



4660 



4665 



Leu Val Leu Thr Gly Trp Asn Asp Thr Thr Ala Ala Val Pro Ala 
4670 4675 4680 

Val Ala Val Pro Glu Leu lie Glu Arg Arg Ala Ala Ala Glu Pro 
4685 4690 4695 

Glu Ala Gly Ala Val Trp Cys Gly Asp Thr His Leu Arg Tyr Gly 
4700 4705 4710 

Glu Leu Asn Ala Arg Ala Asn Arg Leu Ala Arg Leu Leu Val Glu 
4715 4720 4725 

Arg Gly Ala Gly Pro Glu Ser lie Val Ala Val Cys Leu Glu Arg 
4730 4735 4740 

Ser Ala Asp Leu Val Val Thr Leu Leu Ala Val Leu Lys Thr Gly 
4745 4750 4755 

Ala Ala Tyr Leu Pro lie Asp Pro Gly Tyr Pro Ala Gly Arg lie 
4760 4765 4770 

Ala Tyr Met Leu Ala Asp Ala Arg Pro Ala Leu Leu Val Thr Ser 
4775 4780 4785 

Pro Ala Val Ala Ser Gly Asp Ser Leu Pro Asp Gly Gly Ala Gin 
4790 4795 4800 

Arg lie Val Leu Gly Asp Pro Asp Thr Ala Ala Ala Leu Asp Gly 
4805 4810 4815 

Leu Ala Gly Thr Asp Leu Leu Val Ser Glu Arg Arg Gly Val Thr 
4820 4825 4830 

His Pro Ala His Pro Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr 
4835 4840 , 4845 

Gly Arg Pro Lys Gly Val Val Val Pro His Gly Ala Leu Thr Asn 
4850 4855 4860 

Phe Val Ala Ala Met Ser Asp Arg Leu Ala Leu Gly Ala Gly Asp 
4865 4870 4875 

Arg Leu Leu Ala Val Thr Thr Val Ala Phe Asp lie His Val Leu 
4880 4885 4890 

Glu Leu Tyr Val Pro Leu Val Gly Gly Ala Gly Val Val Val Ala 
4895 4900 4905 

Glu Asp Ala Val Val Arg Asp Pro Ala Ala Val Ala Ala Leu Leu 
4910 4915 4920 

Asp Arg His Ala Val Thr lie Val Gin Ala Thr Pro Ala Leu Trp 
4925 4930 4935 

Gin Ala Leu Leu Ala Gly His Ala Asp Ala Val Arg Asp Val Arg 
4940 4945 4950 



Leu Leu Val Gly Gly Glu Ala Leu Pro Pro Ala Leu Ala Gly Arg 
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4955 



4960 



4965 



Met Ala Ala Ala Gly Arg Gly Val Thr Asn Leu Tyr Gly Pro Thr 
4970 4975 4980 

Glu Val Thr Val Trp Ala Thr Val Ala Asp Leu Gly Ala Ser Pro 
4985 4990 4995 

Ala Gly Pro Val Pro lie Gly Thr Pro Leu Arg Asn Thr Arg Ala 
5000 5005 5010 

Phe Val Leu Asp Asp Ala Leu Arg Pro Val Pro Pro Gly Val Pro 
5015 5020 5025 

Gly Glu Leu Tyr Leu Ala Gly Asp Gin Leu Ala Arg Gly Tyr His 
5030 5035 5040 

Gly Arg Ala Gly Leu Thr Ala Glu Arg Phe Val Ala Asp Pro Phe 
5045 5050 5055 

Gly Arg Gly Glu Arg Met Tyr Arg Thr Gly Asp Arg Val Arg Trp 
5060 5065 5070 

Thr Arg Gly Gly Ser Leu Glu Phe Leu Gly Arg Val Asp Asp Gin 
5075 5080 5085 

Val Lys He Arg Gly Phe Arg He Glu Leu Gly Glu Val Glu Ala 
5090 5095 5100 

Ala Leu Ala Ala Phe Gly Pro Val Ala Arg Ala Ala Ala Ala Val 
5105 5110 5115 

Arg Glu Asp Val Pro Gly Asp Arg Arg Leu Val Gly Tyr Val Val 
5120 5125 5130 

Pro Ala Ala Gly Glu Pro Glu Pro Asp Pro Ala Ala Val Arg Ala 
5135 5140 5145 

His Val Ala Ala Gin .Leu Pro Ala Tyr Met Val Pro Ser Ala Val 
5150 5155 5160 

Val Val Leu Pro Asp Leu Pro Leu Thr Ala Asn Gly Lys Leu Asp 
5165 5170 5175 

Arg Lys Ala Leu Pro Ala Pro Asp Tyr Gly Ala Ala Ser Ala Gly 
5180 5185 5190 

Arg Ala Pro Ala Asp Glu Arg Glu Ala Leu He Cys Ala Val Phe 
5195 5200 5205 

Ala Glu Thr Leu Gly Val Thr Asp Val Ala Ala Asp Ala Asp Phe 
5210 5215 5220 

Phe Ala Leu Gly Gly His Ser Leu Leu Ala Val Ser Leu Val Glu 
5225 5230 5235 

Arg Leu Arg Glu His Gly He Ala Val Pro Val Arg Ala Leu Phe 
5240 5245 5250 



Gin Ser Gly Thr Pro Glu Gly Leu Ala Ala Ala Ala Arg Ala Glu 
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5255 



5260 



5265 



Gly Pro Asp Glu Pro Ala Val Pro Ala Asn Gly He Pro Asp Gly 
5270 5275 5280 

Ala Thr Ala Leu Thr Pro Ala Met Leu Thr Leu Val Asp Leu Asp 
5285 5290 5295 

Ala Glu Glu He Ala Arg Val Val Ala Ala Val Pro Gly Gly Ala 
5300 5305 5310 

Ala Asn Val Ala Asp Val Tyr Pro Leu Ala Pro Leu Gin Glu Gly 
5315 5320 5325 

Leu Leu Phe His Ser Leu Met Asp Gly Gly Asp Asp Val Tyr Val 
5330 5335 5340 

Leu Pro Ala Val Leu Gly Phe Asp Ser Arg Ser Arg Leu Asp Ala 
5345 5350 5355 

Phe Leu Ala Ala Leu Gin His Val He Asp Arg His Asp Thr Tyr 
5360 5365 5370 

Arg Thr Ala Val Val His Asp Gly Leu Arg Glu Pro Val Gin Val 
5375 5380 ■ 5385 

Val Trp Arg Arg Ala Thr Leu Pro Val Glu Glu Val Thr Leu Thr 
5390 5395 5400 

Ala Gly Ala Asp Pro Val Gin Glu Leu Leu Ala Thr Ala Pro Val 
5405 5410 5415 

Glu Phe Ala Leu Asp Arg Ala Pro Leu Leu Arg Val Arg Cys Ala 
5420 5425 5430 

Ala Arg Pro Asp Gly Gly Gly Trp Leu Ala Leu Leu Gin He His 
5435 5440 5445 

His Leu Val Gin Asp His Ala Thr Leu Asp Ala Met Leu Ala Glu 
5450 5455 5460 

He Gin Ala Phe Leu Ala Gly Arg Gly Gly Glu Leu Ala Ala Pro 
5465 5470 5475 

Glu Pro Phe Arg Gly Tyr Val Ala Arg Ala Arg Leu Ala Gly Ala 
5480 5485 5490 

Pro Ala Glu His Arg Ala Tyr Phe Ser Arg Leu Leu Gly Asp Val 
5495 5500 5505 

Thr Glu Ser Thr Ala Pro Tyr Gly Leu Thr Asp Ala Arg Asp Ala 
5510 5515 5520 

Arg Pro Thr Gly Lys Ala His Arg Glu Val Asp Arg Arg Leu Ala 
5525 5530 5535 

Ala Arg Val Arg Ala Thr Ala Ser Glu Leu Gly Val Ser Pro Ala 
5540 5545 5550 



Thr Val Phe His Leu Ala Trp Ala Arg Val Leu Gly Thr Leu Ala 
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5555 



5560 



5565 



Gly Arg Asp Asp Val Val Phe Gly Thr Val Leu Leu Gly Arg Leu 
5570 5575 5580 

Gly Ala Gly Ala Arg Ser Gly Arg Ala Leu Gly Pro Phe lie Asn 
5585 5590 5595 

Thr Leu Pro Val Arg Val Arg Leu Ala Ala Ala Gly Ser Arg Glu 
5600 5605 5610 

Thr Leu Ala Gly Leu Arg Ala Gin Leu Ala Glu Leu lie Gly His 
5615 5620 5625 

Glu His Ala Pro Leu Thr Leu Ala Gin Ala Ala Ser Gly Val Pro 
5630 5635 5640 

Gly Gly Thr Pro Leu Phe Thr Ser lie Leu Asn Tyr Arg Gin Gly 
5645 5650 5655 

Pro Pro Ala Gly Asp Asp Thr Gly Asp Glu Glu lie Glu Gly lie 
5660 5665 5670 

Glu Leu Leu Ser Thr Glu Glu Arg Ser Asn Tyr Pro Val Ala Val 
5675 5680 5685 

Ser Val Asp Asp Asp Gly Ser Gly Phe Arg Leu Thr Val Asp Ala 
5690 5695 5700 

Ala Gin Pro Ala Ala Pro Asp Arg Val Ala Glu Leu Leu His Thr 
5705 5710 5715 

Cys Leu His Arg Leu Thr Asp Ala Leu Ala Gly Thr Pro Asp Val 
5720 5725 5730 

Glu Pro Ala Arg lie Asp Val Leu Gly Glu Ala Glu Arg Arg Glu 
5735 5740 5745 

Val Leu Arg Thr Pro Asn Ala Thr Ala Arg Asp Val Ala Ala Ala 
5750 5755 5760 

Thr Leu Pro Ala lie Val Gly Glu Trp Ala Arg Thr Thr Pro Gly 
5765 5770 5775 

Ala Thr Ala Val Thr Ala Glu Asn Asp Arg Leu Thr Tyr Ala Glu 
5780 5785 5790 

Leu Asp Ala Arg Ala Asn Arg Leu Ala Arg Ser Leu lie Ala Arg 
5795 5800 5805 

Gly Val Gly Pro Gly Ala Val Val Gly Met Leu Leu Pro Arg Ser 
5810 5815 5820 

Pro Gly Leu Val Val Ala Met Leu Ala He Val Lys Ala Gly Gly 
5825 5830 5835 

Ala Tyr Leu Pro Leu Asp Pro Gly Tyr Pro Ala Pro Arg Leu Ala 
5840 5845 5850 



Arg Met Val Glu Asp Ala Ala Pro Ala Leu Leu Leu Ala Thr Ala 
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5855 



5860 



5865 



Gly Thr Ala Asp Ala Val Pro Ala Gly Pro Gin Arg Leu Leu Leu 
5870 5875 5880 

Asp Asp Pro Gly Thr Ala Ala Glu Leu Ala Arg Leu Asp Gly Asp 
5885 5890 5895 

Pro lie Arg Asp Glu Glu Arg Thr His Pro Leu Arg Pro Gly His 
5900 5905 5910 

Pro Ala Tyr Leu Met Phe Thr Ser Gly Ser Thr Gly Arg Pro Lys 
5915 5920 5925 

Gly Val Leu Val Pro His Ala Gly lie Asp Arg Met Val Arg Arg 
5930 5935 5940 

Ser Thr Cys Leu Gin Leu Ala Pro Asp Asp Val Leu Pro His Leu 
5945 5950 5955 

Ser Ser Val Ser Phe Asp Ala Ala Thr Phe Glu lie Trp Gly Ala 
5960 5965 5970 

Leu Leu Asn Gly Ala Thr Leu Ala Val Ala Pro Ala Glu Thr Leu 
5975 - 5980 5985 

Ser Val Ala Glu Leu Arg Ala Phe Leu Ala Asp Arg Gly Ala Thr 
5990 5995 6000 

Lys Leu Phe Leu Thr Thr Gly Leu Leu His Glu Val lie Asp Ala 
6005 6010 6015 

Asp Val Thr Ala Leu Ala Gly Leu Lys Ala Val Tyr Thr Gly Gly 
6020 6025 6030 

Asp Val Leu Ser Pro Ala His Cys Arg Ser Leu Leu Asp Arg Val 
6035 6040 6045 

Pro Gly Leu Glu Leu Tyr Asn Ala Tyr Gly Pro Thr Glu Asn Thr 
6050 6055 6060 

Thr lie Thr Thr Leu His Arg Val Arg Pro Glu Asp Leu Asp Ala 
6065 6070 6075 

Gly Thr Gly Val Pro lie Gly Val Pro lie Ser Asp, Thr Arg Val 
6080 6085 6090 

Tyr Val Leu Asp Asp Ala Leu Arg Pro Val Pro Val Gly Val Ala 
6095 6100 6105 

Gly Glu Leu Tyr Thr Ser Gly lie Gly Leu Ala His Gly Tyr Ala 
6110 6115 6120 

Gly Arg Pro Ala Pro Thr Ala Glu Arg Phe Val Ala Cys Pro Phe 
6125 6130 6135 

Ala Pro Gly Glu Arg Met Tyr Arg Thr Gly Asp Leu Val Arg Trp 
6140 6145 6150 



Thr Ala Asp Gly Arg Leu Leu Phe Ala Gly Arg Ala Asp Asn Gin 
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6155 



6160 



6165 



Val Lys lie Arg Gly Phe Arg Val Glu Pro Gly Glu Leu Glu Thr 
6170 6175 6180 

Val Leu Ser Gly His Pro Ala Val Ala Arg Ala Ala Val Leu Ala 
6185 6190 6195 

Arg Glu Asp Thr Pro Gly Ala Lys Arg Leu Val Ala Tyr Val Val 
6200 6205 6210 

Pro Ala Arg Pro Asp Glu Asp Gly Asp Ala Leu Ala Glu Ser Val 
6215 6220 6225 

Arg Ala Tyr Ala Ala Arg Gin Val Pro Asp Tyr Leu Met Pro Ala 
6230 6235 6240 

Ala Thr Val Val Leu Pro Asp Leu Pro Leu Thr Ser Ser Gly Lys 
6245 6250 6255 

Val Asp Arg Ala Ala Leu Pro Ala Pro Asp Val Pro Gly Gly Pro 
6260 6265 6270 

Gly Arg Ala Ala Gly Thr Leu Thr Glu Glu lie Leu Cys Gly Val 
6275 6280 6285 

Phe Ala Gin Val Leu Gly Leu Pro Thr Val Gly Val Asp Asp Asp 
6290 6295 6300 

Phe Phe Ala Ser Gly Gly His Ser Leu Leu Ala Thr Arg Leu Val 
6305 6310 6315 

Ser Arg Leu Arg Ala Val Phe Gly Ala Glu Leu Pro lie Arg Ala 
6320 6325 6330 

Val Phe Glu Ala Pro Thr Pro Ala Thr Leu Ala Thr Arg Leu Gly 
6335 6340 6345 

Ala Ser Ala Pro Arg Arg Leu Ala Leu Gly Glu Arg Ala Arg Pro 
6350 6355 6360 

Glu Asn Val Pro Leu Ser Tyr Ala Gin Arg Arg Leu Trp Phe Leu 
6365 6370 6375 

Asp Arg Leu Glu Gly Gin Asp Gly Thr Tyr Thr lie Pro Leu Thr 
6380 6385 6390 

Val Arg Leu Asp Gly Pro Val Asp Arg Ala Ala Leu Ala Ala Ala 
6395 6400 6405 

Leu Arg Asp Val Leu Glu Arg His Glu Val Leu Arg Thr Val Phe 
6410 6415 6420 

Pro Leu Val Asp Gly Glu Pro Val Gin Arg Val Leu Pro Val His 
6425 6430 6435 

Asp Thr Gly Phe Thr Leu Gly Gly Gly Asp Val Ala Ala Ala Asp 
6440 6445 6450 



Leu Gly Ala Ala Val Ala Glu Ala Thr Ala Gly Thr Phe Asp Leu 
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6455 



6460 



6465 



Ala Ala Glu lie Pro Val Arg Ala Trp Leu Phe Arg Ala Gly Pro 

6470 6475 6480 

Glu Asp His Thr Leu Val Leu Leu Val His His Val Ala Gly Asp 

6485 6490 6495 

Gly Trp Ser Met Thr Pro Leu Ala Arg Asp lie Ala Thr Ala Tyr 

6500 6505 6510 

Asp Ser Arg Arg Glu Ser Arg Ala Pro Gin Trp Glu Pro Leu Pro 

6515 6520 6525 

Val Gin Tyr Ala Asp Tyr Ala Leu Trp Gin Arg Glu Leu Leu Gly 

6530 6535 6540 

Ala Glu Asp Asp Pro Glu Ser Leu Leu Ser Arg Gin Leu Ala Tyr 

6545 6550 6555 

Trp Arg Asp Ala Leu Asp Gly Val Pro Glu Glu Leu Asp Leu Pro 

6560 6565 6570 

Ala Asp Arg Pro Arg Pro Ala Glu Ala Thr His Arg Gly His Glu 

6575 6580 6585 

Val Pro Val Arg Val Pro Ala Glu Val His Arg Arg Leu Ala Glu 

6590 6595 6600 

Leu Ala Arg Ser Glu Gly Val Thr Val Phe Met Val Leu Gin Ala 

6605 6610 6615 

Ala Phe Gly Thr Leu Leu Ser Arg Leu Gly Ala Gly Ala Asp lie 

6620 6625 6630 

Pro lie Gly Thr Ala Val Ala Gly Arg Thr Asp Gin Ala Leu Asp 

6635 6640 6645 

Glu Leu Val Gly Phe Phe Val Asn Thr Leu Val lie Arg Ala Asp 

6650 6655 6660 

Leu Ser Gly Asp Pro Thr Phe Arg Glu Leu Leu Gly Arg Val Arg 

6665 6670 6675 

Ala Thr Gly Leu Ser Ala Tyr Glu His Gin Asp Val Pro Phe Glu 

6680 6685 6690 

Arg Leu Val Glu Val Leu Ala Pro Ala Arg Ser Leu Ala Arg His 

6695 6700 6705 

Pro Leu Phe Gin Val Met Leu Thr Leu Gin Asn Thr Gly Arg Ala 

6710 6715 6720 

Asp Ala Gly Asp Gin Ala Val Pro Pro Ala Ala Gly Ser Ala Ala 

6725 6730 6735 

Ala Lys Phe Asp Leu Glu lie Ser lie Ala Glu Thr Phe Ala Ala 

6740 6745 6750 



Asp Gly Glu Pro Ala Gly Leu Ser Gly Val Leu lie Ala Ala Ala 
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6755 



6760 



6765 



Asp Leu Phe Glu Pro Ala Thr Ala Ala Ala Phe Ala . Glu Arg Leu 
6770 6775 6780 

Ala Arg Val Leu Ala Ala Ala Gly Ala Asp Pro Arg Leu Arg Val 
6785 6790 6795 

Ser Gin Val Asp lie Leu Ser. Ala Glu Glu Arg Glu Ala Val Leu 
6800 6805 6810 

Ser Gly Gly Asn Gly Gly Thr Ala Pro Val Pro Val Thr Thr Val 
6815 6820 6825 

Pro Ala Leu Phe Ala Glu Gin Ala Arg Arg Thr Pro Gly Ala Val 
6830 6835 6840 

Ala Ala Leu Ser Glu Gly Met Ser Leu Thr Tyr Ala Asp Leu Ala 
6845 6850 6855 

Ala Arg Val Asn Arg Leu Ala Arg His Leu Val Ser Leu Gly Ala 
6860 6865 6870 

Gly Pro Glu Thr Val Val Gly lie Ala Met Ser Arg Gly Leu Asp 
6875 6880 6885 

Met Leu Val Ala Val Leu Ala Val Gly Gin Ala Gly Ala Ala Tyr 
6890 6895 6900 

Leu Pro Val Asp Pro Ser Tyr Pro Asp Glu Arg Lys Glu Phe Met 
6905 6910 6915 

Leu Thr Asp Ala Gly Ala Ala Tyr Val Leu Thr Leu Ala Ser Asp 
6920 6925 6930 

Ala Asp Arg Val Pro Pro Gly Thr Pro Ala Ala Ala Val Val Leu 
6935 6940 6945 

Asp Glu Pro Val Thr Ala Ala Arg lie Ala Gly Leu Asp Pro Ala 
6950 6955 6960 

Asp Leu Thr Asp Ala Asp Arg Val Ala Pro Leu Leu Pro Ala His 
6965 6970 " 6975 

Arg Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Arg Pro Lys 
6980 6985 6990 

Gly Val Ala Val Glu His Arg Thr Val Val Asn Leu Leu Ser Trp 
6995 7000 7005 

Ala Ala Gly Arg Phe Gly Gly Ala Asp Phe Ala Arg Thr Leu Ala 
7010 7015 7020 

Ala Thr Ser Leu Asn Phe Asp Val Ser Val Phe Glu He Phe Gly 
7025 7030 7035 

Pro Leu Val Ser Gly Gly Ser He Glu He Val Thr Asp Leu Leu 
7040 7045 7050 



Ala Leu Ala Asp Pro Ala Ser Pro Ala Trp Glu Ala Ser Leu Val 
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7055 



7060 



7065 



Ser Gly Val Pro Ser Ala Phe Ser Arg Val Leu Asp Arg Gly Asp 
7070 7075 7080 

lie Ala Ala Arg Thr Arg Ser Val Val Leu Ala Gly Glu Ala Leu 
7085 7090 7095 

Thr Ala Asp Val Val Asn Ala Thr Arg Ala Ala Leu Pro Gly Val 
7100 7105 7110 

Arg Val Ala Asn He Tyr Gly Pro Thr Glu Ala Thr Val Tyr Ser 
7115 7120 7125 

Thr Ala Trp His Thr Asp Arg Asp Val Thr Gly Gly Ala Ala Pro 
7130 7135 7140 

He Gly Arg Pro Val Thr Asn Thr Arg Ala Tyr Val Leu Asp Asp 
7145 7150 7155 

Arg Leu Thr Pro Val Pro Pro Gly Val Val Gly Glu Leu Tyr Leu 
7160 7165 7170 

Ala Gly Ala Gin Leu Ala Arg Gly Tyr Leu Gly Arg Pro Gly Leu 
7175 7180 7185 

Thr Gly Glu Arg Phe Val Ala Cys Pro Phe Gly Pro Gly Gly Glu 
7190 7195 7200 

Arg Met Tyr Arg Thr Gly Asp Arg Val Arg Trp Asn Ala Asp Gly 
7205 7210 7215 

Asp Leu Val Phe Ala Gly Arg Ala Asp Asp Gin Val Lys He Arg 
7220 7225 7230 

Gly Phe Arg He Glu Pro Gly Glu Val Gin Ala Val Val Ala Arg 
7235 7240 7245 

Gin Ala Gly Val Ala Arg Ala Val Val Leu Ala Arg Ser Asp Ser 
7250 7255 7260 

Pro Gly Asp Ala Arg Leu Val Ala Tyr Val Val Pro Ala Asp Arg 
7265 7270 7275 

Asp Ala Asp Arg Arg Ala Leu Ala Ala Thr Val Arg Ser Asp Thr 
7280 7285 7290 

Ala Arg Glu Leu Pro Ala Tyr Leu Val Pro Ala Ala Val Val Val 
7295 7300 7305 

Leu Asp Glu Leu Pro Val Thr Ala Asn Gly Lys Leu Asp Arg Arg 
7310 7315 7320 

Ala Leu Pro Ala Pro Gly Leu Ala Glu Ala Gly Ser Gly Arg Gly 
7325 7330 7335 

Pro Val Thr His Arg Glu Glu Val Leu Cys Glu Val Phe Ala Gin 
7340 7345 7350 



Val Leu Gly Leu Pro Ser Val Gly Val Asp Asp Asp Phe Phe Ala 
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7355 



7360 



7365 



Leu Gly Gly His Ser Leu Leu Ala Val Ser Leu Val Glu Gin Leu 
7370 7375 7380 

Arg Arg Arg Gly Val Thr Val Gly Val Arg Ala Leu Phe Gin Thr 
7385 7390 7395 

Pro Thr Val Ala Gly Leu Ala Glu Ala Ala Ala Pro Thr Thr Val 
7400 7405 7410 

Ala Val Pro Pro Asn Leu lie Pro Glu Asp Ala Arg His lie Thr 
.7415 7420 7425 

Pro Gly Leu Leu Pro Leu Val Glu Leu Glu Gin Ala Glu lie Asp 
7430 7435 7440 

Gin Val Val Ala Thr Val Asp Gly Gly Ala Ala Asn Val Ala Asp 
7445 7450 7455 

lie Tyr Pro Leu Ala Pro Leu Gin Gin Gly Met Leu Phe His His 
7460 7465 7470 

Leu Met Ala Gly Asp Asp Gly Glu Asp Val Tyr lie Met Pro Ala 
7475 7480 7485 

Val Val Glu Phe Asp Ser Ala Asp Arg Phe Gly Ala Phe Val Asp 
7490 7495 7500 

Ala Leu Gin His Val lie Asp Arg Asn Asp Val Tyr Arg Thr Gly 
7505 7510 7515 

Val Val Trp Asp Gly Leu Arg Glu Pro Val Gin Val Val Trp Arg 
7520 7525 7530 

Arg Ala Pro Leu Pro Val Thr Glu Val Thr Leu Asp Pro Ala Gly 
7535 7540 7545 

Gly Asp Pro Ala Ala Gin Leu His Ala Ala Ala Gly Ala Arg Met 
7550 7555 7560 

Asp Leu Asn Arg Ala Pro Leu Leu Asp Leu His Val Ala Ala Arg 
7565 7570 7575 

Pro Glu Asp Gly Gin Arg Leu Ala Leu Leu Arg Val His His Met 
7580 7585 7590 

Val Gin Asp His Met Gly Leu Glu Val Leu Leu Gly Glu Val Gin 
7595 7600 7605 

Ala Phe Leu Ala Gly Arg Gly Asp Glu Leu Pro Asp Pro Leu Pro 
7610 7615 7620 

Phe Arg Asp Phe Val Ala Gin Thr Arg Gly Gly Val Pro Glu Ala 
7625 7630 7635 

Glu His Arg Arg Phe Phe Ala Gly Leu Leu Gly Asp Val Thr Glu 
7640 7645 7650 



Pro Thr Ala Pro Tyr Gly Leu Leu 



Asp Val His Arg Asp Gly Val 
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7655 



7660 



7665 



Gly Leu Val Arg Gin Glu Arg Pro Leu Asp Gly Glu Val Val Ala 
7670 7675 7680 

Arg Leu Arg Ala Val Ala Arg Arg Leu Gly Val Ser Pro Ala Thr 
7685 7690 7695 

Val Met His Val Ala Trp Ala Arg Val Leu Gly Val He Ser Gly 
7700 7705 7710 

Arg Asp Asp Val Val Phe Gly Thr Leu Leu Leu Gly Arg Phe Ser 
7715 7720 7725 

Thr Gly Ala Asp Arg Val Pro Gly Pro Phe He Asn Thr Leu Pro 
7730 7735 7740 

Val Arg Ala Arg Leu Gly Gly Thr Gly Ala Ala Ala Ala Val Ala 
7745 7750 7755 

Glu Met Arg Arg Leu Leu Ala Glu Leu Leu Glu His Glu His Ala 
7760 7765 7770 

Pro Leu Thr Thr Ala Gin Gin Ala Ser Gly Leu Ser Gly Asn Leu 
7775 7780 7785 

Pro Leu Phe Thr Ala Leu Phe Asn Tyr Arg His Asn Thr Ser Pro 
7790 7795 7800 

Gly Ala Asp Pro Ser Pro Ala Ala Gly Pro Thr Glu Gly He Arg 
7805 7810 7815 

Pro Val Ser Met Arg Glu Arg Thr Asn Tyr Pro He Ser Val Ala 
7820 7825 7830 

Val Asp Asp Asp Gly Glu Gly Leu Gly Val Ala Val Asn Ala He 
7835 7840 7845 

Pro Pro Val Arg Pro Glu Ala Val Cys Glu Leu Val Ala Thr Ala 
7850 7855 7860 

Thr Glu Ser Leu Thr Ser Ala Leu Glu Leu Phe Leu Asp Gly Gly 
7865 7870 7875 

Pro Asp Thr Ala Val Gly Glu Leu Asp Val Leu Pro Pro Gly Glu 
7880 7885 7890 

Arg Ser Arg Leu Leu Val Glu Trp Asn Asp Thr Ala Arg Pro Val 
7895 7900 7905 

Val Glu Ser Ser Val Pro Ala Leu Phe Ala Glu Arg Val Ala Ala 
7910 7915 7920 

Ala Pro Asp Ala Val Ala Val Val Gly Glu Gly Val Ser Trp Ser 
7925 7930 7935 

Tyr Arg Glu Leu Asp Arg Arg Ser Asp Val Leu Ala Arg Ser Leu 
7940 7945 7950 



Val Ala Ala Gly Val Gly Leu Glu Ser Pro Val Val Val Ala Leu 
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7955 



7960 



7965 



Glu Arg Ser Ala Asp Val Leu Thr Ala Phe Leu Ala Val Ala Lys 
7970 7975 7980 

Ala Gly Gly Val Phe Val Pro Val Asp Leu Ser Trp Pro Gin Thr 
7985 7990 7995 

Arg lie Asp Ala Val lie Ala Asp Ser Arg Pro Val Leu Val Leu 
8000 8005 8010 

Asp Ser Val Asp Leu Pro Ala Ala Glu Ala Asp Leu Pro Arg Val 
8015 8020 8025 

Pro Ala Gly Ala Gly Val Tyr Arg Met Tyr Thr Ser Gly Ser Thr 
8030 8035 8040 

Gly Arg Pro Lys Gly Val Val Thr Thr His Gin Asn Leu Val Asp 
8045 8050 8055 

Leu Ala Thr Asp Thr Cys Trp Gly Ser Thr Pro Arg Val Leu Phe 
8060 8065 8070 

His Ala Pro His Ala Phe Asp Ala Ser Ser Tyr Glu lie Trp Val 
8075 8080 8085 

Pro Leu Leu Asn Gly Gly Thr Val Val Val Ala Pro Arg Arg Ser 
8090 8095 8100 

lie Asp Ala Thr Val Leu Arg Asp Leu Val Arg Gly His Glu Leu 
8105 8110 8115 

Thr His Val His Val Thr Ala Gly Leu Leu Arg Val Leu Asp Pro 
8120 8125 8130 

Ser Cys Phe Ala Gly Leu Thr Glu Val Leu Thr Gly Gly Asp Ala 
8135 8140 8145 

Val Ser Ala Glu Ala Val Arg Arg Val Lys Glu Ala Asn Pro Gly 
8150 8155 8160 

Leu Arg Val Arg Gin Leu Tyr Gly Pro Thr Glu Val Thr Leu Cys 
8165 8170 8175 

Ala Thr Gin His Leu Leu Asp Asp Gly Val Pro lie Gly Arg Pro 
8180 8185 8190 

Leu Asp Asn Thr Arg Val Tyr Val Leu Asp Asp Leu Leu Arg Pro 
8195 8200 8205 

Val Pro Thr Gly Val Val Gly Glu Leu Tyr Val Ala Gly Ser Gly 
8210 8215 8220 

Leu Ala Arg Gly Tyr Ala Gly Met Pro Gly Leu Thr Ala Glu Arg 
8225 8230 8235 

Phe Val Ala Asp Pro Phe Ser Val Gly Gly Arg Leu Tyr Arg Thr 
8240 8245 8250 



Gly Asp Leu Val Arg Trp Thr Asp Asp Gly Val Leu His Phe Ala 
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8255 



8260 



8265 



Gly Arg Ala Asp Asp Gin Val Lys lie Arg Gly Tyr Arg Val Glu 
8270 8275 ' 8280 

Pro Gly Glu Val Glu Ala Val Leu Ala Gin His Pro Asp Val Ser 
8285 8290 8295 

Gin Val Ala Val Val Val Arg Glu Asp Ala Pro Gly Asp Lys Arg 
8300 8305 8310 

Leu Val Ala Tyr Val Val Gly Gly Asp Val Glu Ala Tyr Ala Gin 
8315 8320 8325 

Glu Arg Leu Pro Gly Tyr Met Val Pro Ser Ala Phe Val His Leu 
8330 8335 8340 

Glu Ala Leu Pro Leu Thr Ala Asn Gin Lys Val Asp Arg Ala Ala 
8345 8350 8355 

Leu Pro Ala Pro Glu Arg Glu Thr Thr Thr Pro Gly Lys Ala Pro 
8360 8365 8370 

Ala Pro Gly Pro Leu Gly Asn Leu Glu Glu Ser Met Cys Gin Ala 
8375 8380 8385 

Phe Ala Glu Val Leu Gly Leu Asp Ser Val Gly Pro Asp Asp Asp 
8390 8395 8400 

Phe Phe Ala Leu Gly Gly His Ser Leu Leu Ala Val Ala Leu Val 
8405 8410 8415 

Gin Arg Leu Lys Ala Arg Gly Val Ala Val Thr Val Gin Asp lie 
8420 8425 8430 

Met Ala Ala Pro Thr Val Ser Glu Leu Met Gly Ser Leu Ser Met 
8435 8440 8445 

Ser Ser lie Arg Asp Ser Leu Gly Thr Leu Leu Pro lie Arg Arg 
8450 8455 8460 

Thr Gly Glu Leu Pro Pro Leu Phe Cys Val His Pro Ala Gly Gly 
8465 8470 8475 

Leu Ser Trp Cys Tyr Leu Pro Leu Ala Arg His Val Pro Ala Asp 
8480 8485 8490 

Arg Pro lie Tyr Gly Leu Gin Ala Arg Gly Ala Asp Gly Arg Glu 
8495 8500 8505 

Pro Leu Ala Pro Ser Leu Arg Glu Met Ala Ala Asp Tyr Val Ser 
8510 8515 8520 

Arg Met Arg Ala Val Gin Pro Glu Gly Pro Tyr His Val Leu Gly 
8525 8530 8535 

Phe Ser Phe Gly Val Ala Pro Ala His Glu He Ala Val Gin Leu 
8540 . 8545 8550 



Arg Glu Gin Gly Ala Glu Val Val Leu Val Leu Met Asp Ser Tyr 
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8555 



8560 



8565 



Pro Met Glu Asp Ala Glu Ser Gly Glu Gin Ala Ala Asp Glu Glu 
8570 8575 8580 

Glu Leu Pro Trp Glu Glu Leu lie Glu Ala Glu Phe Gly Arg Val 
8585 8590 " 8595 

Leu Gly Gly Phe Ser Arg Asp Glu Leu Ala Ala Phe Ala Ala Val 
8600 8605 8610 

Phe Arg Asn Asn Thr Lys lie Arg Ala Arg His Arg Leu Gly Arg 
8615 8620 8625 

Phe Asp Gly Asp Ala Leu Leu lie Ala Ser Thr Asp Ser Ala Pro 
8630 8635 8640 

Asp Gly Glu Ser Asn Thr Trp Arg Trp Ala Pro Tyr lie Thr Gly 
8645 8650 8655 

Glu lie Thr Gin Val Val Leu Pro Cys Glu His Thr Asp Leu Val 
8660 8665 8670 

Arg Pro Asp Met Leu Ala Leu Leu Trp Pro Ala Val Glu Ala Trp 
8675 8680 8685 

Gin Ala Gly Arg His Arg Pro 
8690 8695 

<210> 16 
<211> 234 
<212> PRT 

<213> Actinoplanes sp. 
<400> 16 

Met Gin Lys He Pro Leu Val Cys Val Pro Phe Ala Gly Ala Gly Ala 
15 10 15 

Ser Phe Phe His Pro Trp Ala Glu Leu Ala Gly Pro Asp Arg Pro He 
20 25 30 

Val Ala Leu Gin Leu Pro Gly Arg Glu Trp Arg Leu Leu Asp Glu Pro 
35 40 45 

Tyr Ala Asp Val Val Ala Ala Ala Ala Asp Leu Ala Leu Thr Val Ala 
50 55 60 

Asp Glu Val Gly Ala Gly Gly Arg Val Ala Leu Phe Gly His Ser Leu 
65 70 75 80 

Gly Ala Val Leu Ala Tyr Glu He Ala His Ala Leu Val Arg Asp Gly 
85 90 95 

Glu Val Gly Val Glu Arg Leu Phe Val Ser Gly Ser Pro Asp Pro Trp 
100 105 110 

Thr Pro Arg Thr Asn Arg Ala Ser Gly Leu Asp Asp Glu Glu Phe Leu 
115 120 125 



123 



Leu Arg Val Arg Glu Phe Ala Gly Tyr Asp His Glu Ala Leu Ala Asp 
130 135 140 



Pro Asp Met Arg Glu Leu lie Leu Pro Ala Leu Arg Ala Asp Val Glu 
145 150 155 160 

Met His Glu Ser Tyr Val Ala Gly Ser Ala Asp Pro Leu Pro Ala Pro 
165 . 170 175 

Val Thr Ala Leu His Ala Arg Asp Asp Ala Leu Val Ser Ala Glu Gin 
180 185 190 

Thr Ala Gly Trp Ser Lys Ala Thr Ser Gly Pro Phe Gin Leu Val Glu 
195 200 205 

Val Asp Gly Gly His Met Tyr Leu Thr Glu Asp Pro Ala Gly Leu Leu 
210 215 220 

Arg Leu lie Ala Ala Asp Leu Asp Arg Asp 
225 230 

<210> 17 
<211> 274 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> <1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine reidue a 
t this position 



<400> 17 

Val Arg Leu Thr Gly Lys Thr Ala lie Val Thr Gly Ala Ala Arg Gly 
15 10 15 

Leu Gly Arg Ala Cys Ala Val Ala Phe Ala Ala Glu Gly Ala Asp Leu 
20 25 30 

Val Leu Leu Asp Arg Ala Ala Asp Leu Pro Gly Val Pro Tyr Pro Leu 
35 40 45 

Gly Thr Val Gly Gin Leu Glu His Thr Ala Asp Leu Cys Arg Lys Gin 
50 55 60 

Gly Ala Ala Val Leu Thr Val Arg Ala Asp Val Arg Asp Leu Ala Ala 
65 70 75 80 

Leu Thr Ala Ala Ala Asp Arg Ala lie Asp Arg Phe Gly Gly lie Asp 
85 90 95 

Val Leu Val Asn Asn Ala Gly lie Ala Ala Pro Ser Gly Lys Val Thr 
100 105 110 

His Glu lie Thr Glu Asp Glu Trp Gin Leu Met lie Asp Val Asp Leu 
115 120 125 



124 



Ser Gly Ala Trp Arg Met Thr Ala Ala Val Gly Arg His Met Thr Glu 
130 135 140 

Arg Arg Ser Gly Ser He Val Asn He Ala Ser Thr Ala Gly Gin Val 
145 150 155 160 

Gly Tyr Arg His Phe Ala Gly Tyr Val Ala Ala Lys His Gly He Val 
165 170 175 

Gly Leu Thr Arg Ala Ala Ala Leu Asp Tyr Ala Pro Ala Lys Val Arg 
180 -185 190 

Val Asn Ala Val Cys Pro Gly Ser Val Arg Asp Asp Pro Gin Phe Glu 
195 200 205 

Gly Arg Met Leu Ser Glu He Ala Arg Ser Leu Asp Val Pro Val Ala 
210 215 220 

Glu His Glu Gin Thr Phe Leu Gin Ala Gin Pro Met Asn Ala Leu He 
225 230 235 240 

Glu Pro Asp Asp Val Ala Asn Ala Ala He Trp Leu Ala Ser Asp Glu 
245 250 255 

Ser Arg Gin Val Thr Gly Ser Val Val Thr Val Asp Gly Gly Phe Thr 
260 265 270 



Thr Arg 



<210> 18 
<211> 891 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_feature 
<222> (1)..(1) 

<223> V is a non-standard initiator codon. It is expected that the bio 
synthesized protein will have a f ormylmethionine residue at this 
position 



<400> 18 

Val Pro Lys Ser Gin Pro Ala Thr Arg Thr Ala Ala Pro Gly Ala Ala 
1 5 10 15 

Glu Cys His Ala Leu Ala Val Arg Leu Ala Gly Pro He Asp Pro Ala 
20 25 30 

Pro He Glu Arg Arg Leu Ala Ala Arg Met Pro Phe Trp His Glu His 
35 40 45 

Val Ala Ala Arg Pro Gly Asp Glu Ala Ala Leu Arg Arg Arg Glu Arg 
50 55 60 

Glu Leu Ala Arg Pro Val Pro Pro Glu Pro Gly Ala Arg Ala Val Leu 
65 70 , 75 80 



125 



Leu Ala Tyr Ala Asp Gly Ser Ala Asp Leu Val Leu Val Ala Arg Arg 
85 90 95 



Asp Arg Leu Asp Arg Asp Ala Leu lie Ala Leu Ala Arg Pro Glu Arg 
100 105 110 

Ala Pro Arg Gly Arg Lys Pro Ala Glu Pro Asp Ala Pro Pro Pro Ser 
115 120 125 

Ala Ala Pro Ala Trp Gly Leu Gly Asp Gly Gly Pro Asp Asp Arg Trp 
130 135 140 

Ala Glu Leu Arg Val Pro Ala Arg Gly Pro Ala Asp Pro Ala Arg Trp 
145 150 155 160 

Pro Ala Ala Leu Ala Lys Val Leu Ala Arg Tyr Glu Pro Gly Ala Ala 
165 170 175 

Ala Gly Ser Gly Ala Ala Ala Gly Leu Gly Ala Ala Ala Gly Ser Gly 
180 185 190 

Val Ala Ala Gly Ser Ser Ala Ala Ser Gly Ser Gly Ala Ala Ala Val 
195 200 205 

Pro Gly Pro Val Ala Leu Ala Phe Asp Gly Asp Leu Ala Pro Pro Asp 
210 215 220 

Glu Tyr Val Pro Phe Leu Ala Pro Thr His Pro Leu Thr Val Gin Val 
225 230 235 240 

Ser Arg Thr Pro Gly Gly Gly Thr Glu Leu Arg Cys Arg His Arg Leu 
245 250 255 

Gly Ala Val Ser Pro Ala Ala Ala Glu Ala Phe Ala Arg Met Leu Ala 
260 265 270 

Ala Ala His Gly Glu Pro Pro Ala Asp Asp Gly Ala Thr Ala Glu Pro 
275 280 285 

Thr Pro Pro Ala Ala Pro Ala Pro Ala Pro Ala Pro Ala Pro Ala Pro 
290 295 300 

Pro Ala Ala Ala Arg Thr Leu Thr Gly Leu Phe Ala Glu Gin Val Ala 
305 310 315 320 

Ala Arg Pro Thr Ala Val Ala Val Ser Asp Asp Arg Gly Arg His Thr 
325 330 335 

Tyr Arg Glu Leu Asp Glu Trp Ser Gly Arg Leu Ala Arg Gly Leu Arg 
340 345 350 

Lys Ala Gly Val Arg Asp Gly Asp Ala Val Gly Val Cys Leu Asp Arg 
355 360 365 

Ser Ala Glu Leu Val Ala Val Leu Leu Ala Val Leu Lys Ala Gly Ala 
370 375 380 



Ala Tyr Val Pro Leu Asp Ala Ala Tyr Pro Ala Asp Arg lie Ala Tyr 
385 390 395 400 
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Thr Val Gly Asp Ala Gly Leu Ala Val Val Val Thr Thr Ser Ala Asp 
405 410 415 



Phe Pro Asp Val Asp Gly Val Arg Leu Leu Ala Pro Glu Ser Leu Ala 
420 425 430 

Glu Ala Gly Asp Asp Pro Gly lie Pro Leu Ala Thr Pro Ala Gly Pro 
435 440 445 

Glu Arg Pro Ala Tyr Val lie Tyr Thr Ser Gly Ser Thr Gly Arg Pro 
450 455 460 

Lys Gly Val Val Val Pro His Ala Asn Val Ser Ala Leu Leu Asp Ala 
465 470 475 480 

Thr Arg Glu Glu Tyr Ala Leu Gly Pro Gly Asp Val Trp Thr Phe Phe 
485 490 495 

His Ser Ala Ala Phe Asp Phe Ser Val Trp Glu lie Trp Gly Cys Leu 
500 505 510 

Leu Thr Gly Gly His Leu Val Val Val Pro Tyr Trp Val Ser Arg Ser 
515 520 525 

Pro Glu Gin Phe His Asp Leu Leu Ala Glu Arg Gly Val Thr Val Leu 
530 535 540 

Asn Gin Thr Pro Ser Ser Phe Thr Gin Leu Val Ala Ala Asp Arg Gly 
545 550 555 560 

Ala Glu Arg Asp Leu Ala Val Arg Leu Val lie Phe Gly Gly Glu Pro 
565 570 575 

Leu Asp Ala Arg Thr Val Leu Pro Trp Leu Asp Arg Arg Pro Glu Ala 
580 585 590 

Arg Cys Arg Leu Val Asn Met Phe Gly lie Thr Glu Thr Thr Val His 
595 600 605 

Val Thr Ala Val Asp Val Thr Arg Ala Ala Ala Leu Ala Gly Ser Arg 
610 615 620 

Ser Val Gly Arg Pro Leu Pro Gly Trp Ala Val Arg Val Leu Asp Glu 
625 630 635 640 

Gin Arg Arg Glu Val Pro Pro Gly Val Pro Gly Glu He Tyr Val Gly 
645 650 655 

Gly Ala Gly Val Ala He Gly Tyr Leu Asn Arg Pro Glu Leu Thr Ala 
660 665 670 

Glu Arg Phe Val Thr Gly Pro Asp Gly Arg Arg Trp Tyr Arg Ser Gly 
, 675 680 685 

Asp Arg Gly Arg Leu Leu Pro Asp Gly Thr Leu Glu His Leu Gly Arg 
690 695 700 

Leu Asp Asp Gin Val Lys Leu Arg Gly Phe Arg He Glu Leu Asp Glu 
705 710 715 720 
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He Arg Gly Val Leu Thr Glu Cys Ala Gly Val Ala Ala Ala Ala Val 
725 730 735 

Val He Arg Arg Ser Thr Pro Asp Asp Pro Ala Thr Ala Arg Leu Asp 
740 745 750 

Ala Tyr Val Val Ala Glu Ala Gly Ala Thr Pro Pro Val Ala Glu His 
755 760 765 

Ala Ala Arg Met Leu Pro Ala Tyr Met Cys Pro Ala Thr Phe Thr Phe 
770 775 780 

Leu Asp Ala Leu Pro Met Thr Pro Asn Gly Lys Val Asp Lys Ala Ala 
785 790 795 800 

Leu Pro Glu Pro Ala Arg Pro Ala Ala Asp Ala Ala Ala Thr Pro Ala 
805 810 815 

Gly Pro Gly Glu Asp Gly Leu Ala Gly Asp Leu Ala Asp Val Trp Gin 
820 825 830 

Gin Val Phe Gly Cys Pro Val Thr Val Ser Asp Asn Phe Phe Asp Leu 
835 840 845 

Gly Gly Asn Ser Leu Leu Ala Val Arg Met Ala Ala Leu Met Arg Arg 
850 855 860 

Arg Gly Leu Pro Arg Leu His Pro Arg Thr Leu Tyr Leu His Pro Thr 
865 870 875 880 

Val Arg Gly Leu Ala Asp Ala Leu Arg Ser Ala 
885 890 

<210> 19 
<211> 187 
<212> PRT 

<213> Actinoplanes sp. 
<400> 19 

Met Arg Asn Leu Arg Arg Thr Thr Gly He Gly Leu Leu Ala Leu Leu 
15 10 15 

Ser Val Ala Ala Cys Ser Ser Thr Pro Ala Ala Ser Glu Pro Pro Pro 
20 25 30 

Ser Ala Ala Pro Pro Ser Ala Val Thr Ala Thr Gly Pro Ala Ala Glu 
35 40 45 

Lys Ala Val Lys Ser Gly Thr Gin Thr Tyr His Gin Ala Leu Asp Ala 
50 55 60 

Phe Val Ala Ala Ser Asn Lys Gly Thr Thr Asp Thr Thr Glu He Gly 
65 70 75 80 

Lys Tyr Ala Ser Gly Arg Ala Leu Met Thr Phe Gin Gly He Leu Ala 
85 90 95 

Ser Tyr Gin Gin Gin Gly Val His Thr Ser Gly Glu Pro Arg He Asp 
100 105 110 
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Glu Pro Val Val Thr Gly Leu Thr Pro Pro Ala Asp Pro Thr Gly Val 
115 120 125 

Gin Leu Arg Gly Cys lie Asp lie Ser Ala Trp Pro Leu Thr Lys Ala 
130 135 140 

Asp Gly Thr Pro Ala Asp Lys Val Gly Gly Gin Gin Gly Ser Gly Pro 
145 150 155 160 

Ser Ala lie Leu Ala Asn Val Ala Arg Ser Gly Ala Thr Trp Gin Val 
165 170 17-5 

Thr Glu Leu Ala lie Gin Gly Pro Cys Ala Ala 
180 185 

<210> 20 
<211> 415 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_feature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this -position 



<400> 20 

Val Thr Val Arg Arg Trp Leu Pro Ala Gly Leu Thr Val Leu Ala Phe 
1 5 ' 10 15 

Ala Ala Gly Phe Trp Gin Lys Leu Pro Cys -Gin Ala Ala Gly Trp Pro 
20 25 30 

Asp Asp Thr Ala Thr Leu Phe Gly Arg Tyr Cys Tyr Ser Asp Val Pro 
35 40 45 

lie Leu Phe Arg Glu Arg Gly Leu Phe Asp Gly lie Phe Pro Tyr Glu 
50 55 60 

Ser Gly Pro Gly Ala Gin Pro Leu Glu Tyr Pro Val Leu Thr Gly Tyr 
65 70 75 80 

Leu Met Asp Ala Thr Ala Arg Leu Val Arg Ala lie Leu Pro Gly Ala 
85 90 95 

Asp Val Ala Val Ala Ser Arg Ala Tyr Phe Leu Thr Thr Val Leu Val 
100 105 110 

Leu Leu Ala Leu Ala Val Leu Thr Val Trp Ala Thr Gly Ala Val Leu 
115 120 125 

Arg Arg Thr Gly Gly Arg Pro Gly Asp Ala Leu Leu Val Ala Ala Ala 
130 135 140 

Pro Val Leu lie Leu Ala Gly Thr Val Asn Trp Asp Leu Leu Ala Val 
145 150 155 160 
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Ala Ala Ala Val Leu Ala lie Leu Ala Trp Glu Arg Asp Arg Pro Leu 
165 170 175 



Leu Ala Gly Val Leu lie Gly Leu Gly Thr Ala Ala Lys Leu Phe Pro 
180 185 . 190 

Leu Val Leu Leu Gly Pro Val Leu Leu Leu Cys Leu Arg Gin Arg Arg 
195 200 205 

Met Arg Arg Phe Ala Arg Val Ala Ala Gly Ala Ala Gly Ala Trp Leu 
210 215 220 

Leu Val Asn Leu Pro Val Val Ala Leu Gin Pro Asp Gly Trp Met Glu 
225 230 235 240 

Phe Trp Arg Phe Asn Ala Gly Arg Gly Ala Glu Phe Gly Ser Leu Trp 
245 250 255 

Phe Ala Leu Asp Gly Leu Gly Leu His Met Pro Ala Val Asn Ala Val 
260 265 270 

Ala Leu Ala Thr Phe Gly Val Leu Leu Ala Gly lie Ala Val Leu Ala 
275 280 285 

Leu Arg Ser Arg Arg Pro Pro Asp Leu Ala Gin Leu Ala Cys Leu Ala 
290 295 300 

Val Gly Ala Phe Leu Leu Thr Asn Lys Val Tyr Ser Pro Gin Tyr Ala 
305 310 315 320 

Leu Trp Leu Leu Pro Leu Val Val lie Ala Arg Gly Arg Val Pro Arg 
325 330 335 

Trp Pro Val Val Arg Asp Trp Ala Val Trp Gin Ala Ala Glu Val Leu 
340 345 350 

Tyr Trp Leu Ala Val Trp Ser Trp Leu Ala Gly Ser Leu Thr Asp Glu 
355 360 365 

Arg Gin Tyr Ala Trp Ala Thr Val Leu Arg Val Leu Ala Thr Ala Tyr 
370 375 380 

Val Cys Gly Gin Val Val Trp Asp Val Leu Ala Ala Pro Arg Pro His 
385 390 395 400 

Arg Pro Ala Pro Pro Pro Ala Val Ala Glu Pro Ala His Pro Gly 
405 410 415 

<210> 21 
<211> 491 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> (1)..{1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 
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<400> 21 



Val Ala Ala Gin Pro Glu Glu Phe Asp Val He Val Val Gly Gly Gly 
15 10 15 

Pro Gly Gly Ser Thr Ala Ala Ala Leu Thr Ala Lys Gin Gly Ala Lys 
20 25 30 

Val Leu Leu Leu Glu Arg Glu Lys Phe Pro Arg Tyr Gin He Gly Glu 
35 40 45 

Ser Leu Leu Pro Ser Thr Val His Gly Val Cys Asn Leu Leu Gly Val 
50 55 60 

Gly Asp Glu He Ala Lys Ala Gly Phe Met Arg Lys His Gly Gly Thr 
65 70 75 80 

Phe Lys Trp Gly Thr Ser Thr Glu Pro Trp Thr Phe Thr Phe Ala Thr 
85 90 95 

Ser Pro Arg Met Ala Gly Pro Thr Ser His Ala Phe Gin Val Glu Arg 
100 105 110 

Arg Arg Phe Asp Gin He Leu Leu Glu Asn Ala Arg Arg Leu Gly Val" 
115 120 125 

Asp Val Arg Glu Asn His Pro Val Thr Glu Ala He Ala Asp Asp Glu 
130 135 * -140 

Arg Val Arg Gly Val. Arg Phe Thr Gin Asp Gly Gin Thr Arg Thr Ala 
145 150 155 160 

Leu Ala Arg Phe Val Val Asp Ala Ser Gly Asn Arg Ser Thr Leu His 
165 170 175 

Thr Thr Val Gly Gly Thr Arg Glu Tyr Ser Pro Phe Phe Arg Asn Leu 
180 185 190 

Ala Leu Phe Gly Tyr Phe Glu Asn Gly Arg Arg Leu Pro Ala Pro Asn 
195 200 205 

Ser Gly Asn He Leu Cys Val Ala Phe Gly Ser Gly Trp Phe Trp Tyr 
210 215 220 

He Pro Leu Ser Glu Thr Leu Thr Ser Val Gly Ala Val Val Arg Arg 
225 230 235 240 

Glu Met Ala His Lys Val Gin Gly Asp Gin Glu Lys Ala Leu Phe Glu 
245 250 255 

Leu He Ala Glu Cys Pro Met He Ala Asp Phe Leu Gly Asp Ala Thr 
260 265 270 

Arg Val Thr Glu Gly Asp Tyr Gly Gin He Arg Val Arg Lys Asp Tyr 
275 280 285 

Ser Tyr Ser Ser Thr Ser Tyr Trp Arg Pro Gly Met Cys Leu Val Gly 
290 295 300 
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Asp Ala Ala Cys Phe lie Asp Pro Val Phe Ser Ser Gly Val His Leu 
305 310 315 320 



Ala Thr Tyr Ser Gly Leu Leu Ala Ala Arg Ser lie Asrr Ser Val Leu 
325 330 335 

Ala Gly Thr Val Asp Glu Asp Arg Ala Phe Thr Glu Phe Glu Gin Arg 
340 345 350 

Tyr Arg Arg Glu Phe Gly Val Phe His Asp Phe Leu Val Ser Phe Tyr 
355 360 365 

Asp Met His Val Asp Glu Ser Ser Tyr Phe Trp Ala Ala Arg Lys Val 
370 375 380 

j 

Thr Glu Ser Ser Ala Pro Ala Met Glu Ser Phe Thr Glu Leu Val Gly 
385 390 395 400 

Gly lie Ala Ser Gly Glu Asp Ala Leu Thr Gly Ser Thr Glu Leu Val 
405 410 415 

Arg Arg His Ser Arg Gin Thr Ala Glu Leu Gly Gin Ala Val Ala Gly 
420 425 430 

Leu Glu Glu Gly Gly Thr Gly Phe Leu Arg Gly Ser Ser Val Val Ala 
435 440 445 

Gin Ala Met Phe Glu Gly Ser Gin He Gin Ala Gly Ala He Leu Gly 
450 455 460 

Pro Glu Gly Thr Gin Glu Gin Pro Leu Phe Glu Gly Gly Leu Thr Pro 
465 470 475 480 

Ser Gly Asn Gly Leu Thr Trp Val Ala Ala Asp 
485 490 

<210> 22 
<211> 217 
<212> PRT 

<213> Actinoplanes sp. 
<400> 22 

Met Thr He Arg Val Leu He Ala Asp Asp Gin Ala Met He Arg Ser 
15 10 15 

Gly Leu Arg Leu lie Leu Glu Asp Glu Pro Asp He Glu Val Val Ala 
20 25 30 

Glu Ala Val Asp Gly Val Asp Ala Val Ala Gin Ala Arg Lys Leu Arg 
35 40 45 

Pro Asp Val Cys Leu Val Asp He Arg Met Pro Arg lie Asp Gly He 
50 55 60 

Glu Val Thr Arg Ser Leu Ala Gly Pro Gly Val Val Asn Pro Leu Arg 
65 70 75 80 



Val He Val Val Thr Thr Phe Asp Ser Asp Glu Tyr Val Tyr Gly Ala 
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Leu Arg Gly Gly Ala Val Gly Phe lie Leu Lys Asp Ala Gly Pro Thr 
100 105 110 

Leu Leu Val Glu Ala Val Arg Ala Ala His Lys Gly Asp Ala Leu Val 
115 120 125 

Ser Pro Ser Val Thr Val Arg Leu Leu Asn His Leu Asn Ala Ser Ala 
130 135 140 

Ala Pro Ala Gly Ser Glu Pro lie Pro Leu Ser Asp Arg Glu Leu Glu 
145 150 155 160 

Val Ala Arg Ala lie Ala Arg Gly Arg Thr Asn Gin Glu lie Ala Ala 
165 170 175 

Asp Leu Phe lie Ser Leu Ser Thr Val Lys Gly His Ala Ser Thr lie 
180 185 190 

Gin Ser Lys Leu Gly Val Arg Asn Arg Val Gly Val Ala Ala Trp Ala 
195 200 205 

Trp Glu Asn Arg Leu Val Glu Gly Ser 
210 215 

<210> 23 
<211> 403 
<212> PRT 

<213> Actinoplanes sp. 
<400> 23 

Met Asn He Ala Ala Ala Thr Gly Pro Ala Ala Gly Asp Gly Ala Gly 
15 10 15 

He Arg Thr Leu Gly Ser Val Arg Thr Ala Asp Arg Thr Thr Thr Met 
20 25 30 

Val Ala Asp Ala Gly Leu Ala Val Leu Phe Val Ala Ala Val Val Val 
35 40 45 

Glu Ala Val Ala Val Ala Gin Ser Trp Gly Leu Ala Tyr Trp Leu He 
50 55 60 

Gly Gly Ala Ala Ala Thr Leu Val Cys Leu Leu Ala Leu He Arg Arg 
65 70 75 80 

Arg Gly Pro Val Pro Cys Ala Ala Ala Gly Leu Thr He Ala Ala Gly 



Ala Val Val Thr Ala Ala Val Leu His Met Pro Ala Glu Pro Gly Pro 
100 105 110 

Ala Met Ala Leu Ala Leu Ala Val Leu Thr Gly Ser Ala Val Arg Ala 
115 120 125 

Ala Pro Thr He Pro Ala Phe Ala Val Gly Gly Ala Ala Leu Gly Val 
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90 



95 



130 



135 



140 
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Val Ala Leu Ser Gin Val Ala Ala Ala Thr Trp Asp Ala Gly Pro Ala 
145 150 155 160 

Pro Val Thr Trp Leu Asn lie Leu Thr Trp Leu Gly Gly Thr Ala Thr 
165 170 175 

Gly Leu Ser Leu Arg Thr Val Asp Gly Arg Ala Arg Ala Asn Ala Glu 
180 185 190 

Arg lie Arg Gin Glu Glu Arg Leu Glu Leu Ala Arg Glu Leu His Asp 
195 200 205 

Val Val Ala His His lie Thr Gly Met lie Leu Gin Thr Gin Ala Ala 
210 215 220 

Gin Val Leu Ala Arg Arg Asp Ala Gly Arg Val Pro Glu Arg Leu Ala 
225 230 235 240 

Val He Glu Thr Ala Gly Thr Glu Ala Leu Ala Ala Met Arg Arg Val 
245 250 255 

Val Gly Leu Leu Arg Asp Ala Asp Asp Gly Pro Pro Ser Ala Pro Glu 
. 260 265 270 

Pro Glu Glu Leu Ser Thr Leu Val Glu Arg Phe Ser Arg Gin Gly Gly 
275 280 285 

Pro Val Arg Leu Thr Thr Pro Asp Gly Met Lys Gin Trp Pro He Glu 
290 295 300 

Val Thr Thr Thr Val Tyr Arg He Val Arg Glu Ala Leu Thr Asn Val 
305 310 315 320 

Ala Arg His Ala Pro His Ala Pro Asn Val Thr Val Thr Val Thr Val 
325 330 335 

Glu Gin Ala Asp Glu He Arg Val Glu Val Thr Asn Asp Ala Ala Ala 
340 345 350 

Ala Pro Pro Arg Leu His His Arg Gly Gly Tyr Gly Leu Val Gly Met 
355 360 365 

Arg Glu Arg Val Glu Ser Leu Gly Gly Thr Leu Ser Thr Gly Pro Arg 
370 375 380 

Pro Gly Gly Gly Trp Ser Val Ala Ala Thr Leu Pro Asn Pro Pro Arg 
385 390 395 400 

Glu Arg Arg 



<210> 24 

<211> 309 

<212> PRT 

<213> Actinoplanes sp. 

<400> 24 

Met Lys Ala Met Ser His Glu Arg Ser Thr Pro Val Leu Gin Ala Glu 
1 5 10 15 



134 



Gly Leu Thr Lys Arg Tyr Gly Arg Arg Arg Ala Leu Thr Asp Cys Thr 
20 25 30 

Leu Ser Val Pro Ser Gly Arg Val lie Ala Leu Val Gly Pro Arg Gly 
35 40 45 

Ser Gly Lys Ser Thr Leu Leu Gin Leu Cys Cys Gly Met Val Ala Pro 
50 55 60 

Ser Arg Gly Arg lie Arg Val Leu Gly Glu Arg Pro Asp Ala Gly Ala 
65 70 75 80 

Ala His Leu Ala Arg Val Gly Tyr Val Pro Arg Glu Pro Ala Val Tyr 
85 90 95 

Gly Ser Phe Thr Val Glu Asp His Leu Thr Met Gly Ala Arg Leu Asn 
100 105 110 

Pro Arg Trp Asp Arg Arg Leu Ala Asp Arg Arg lie Ala Ser Ala Gly 
115 120 125 

lie Pro Arg Thr Arg Arg Ala Asp Arg Leu Ser Ala Gly Gin Arg Ala 
130 135 140 

Glu Leu Ala Leu Thr Leu Ala Gly Gly Lys Arg Pro Glu Leu Leu Val 
145 150 155 160 

Leu Asp Glu Pro Gly Ala Val Leu Asp Ala Pro Ala Arg Ala Ser Phe 
165 170 175 

Leu Arg Gly Val Leu Asp Phe Val Ala Glu lie Asp Ala Ser Val Leu 
180 185 190 

lie Ser Gly His Pro Ser Gly Glu Val Glu Arg Leu Cys Asp His Leu 
195 200 205 

lie Val Leu Ser Asp Ser Arg Val Leu Val Ala Gly Asp Val Arg Asp 
210 215 220 

Leu Leu Ala Arg His His Arg lie lie Ala Pro Arg Gly Glu Leu Asp 
225 230 235 240 

Arg Leu Pro Pro Gly Met Glu Pro lie Trp Val Glu Asp Phe Gly Ser 
245 250 255 

Tyr Ser Gly Gly Val Val Arg Ala Glu Val Asp Leu Pro Arg Arg Pro 
260 ' 265 1 270 

Trp Thr Val Glu Arg Val Glu Leu Glu Glu Leu Val Leu Ser Tyr Leu 
275 280 285 

Ser Arg Ala Ser Gly Ala Pro Ala Leu Ala Gly Cys Leu lie Ala Pro 
290 295 300 

Gly Gin Pro' Gly Ser 
305 

<210> 25 
<211> 553 
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<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_feature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 25 

Val Thr Ala Ala Ala Leu Glu Lys Leu Leu Gly Asp Ala Arg Asp Pro 
15 10 15 

Gly Asn Pro Val Gly Tyr Ala Ala Val Leu Ala Ala Asp Glu Arg Gin 
20 25 30 

Glu Met Leu Ala Glu Gly Glu Arg Leu Leu Asp Arg Tyr Gin Leu Asn 
35 40 45 

Ala Glu Phe Val Pro Val Ala Tyr Gly Gly Arg Leu Ala Arg Ala Asp 
50 55 60 

Arg Leu Ala Glu Val Leu Arg Ala Val Trp Arg Arg Asp Pro Cys Leu 
65 70 75 80 

Gly Leu Gly Tyr Gly Phe Ser Ser Leu lie Ala Ser Val Asn Val Trp 
85 90 95 

Cys Ala Gly Asn Glu Glu Gin Arg Arg Arg Ala Ala Gly Leu Leu Leu 
100 105 110 

Ala Asn Lys Arg lie Ala Ala Ala Phe His Glu Leu Ala His Gly Thr 
115 120 125 

Asp Phe Ser Ala Ala Glu Cys Ala Ala Arg Pro Ala Gly Gly Gly Trp 
130 135 140 

Val Leu Ser Gly His Lys Glu lie Val Thr Asn Leu Arg Arg Ala Glu 
145 150 155 160 

Ala Met Val Leu Phe Ala Arg Thr Gly Glu Ala Arg Gly Ser Arg Ser 
165 ' 170 175 

His Ser Gin Phe Leu Leu Val Arg Asp Glu Leu Pro Ala Ala Arg Ala 
180 185 190 

Val Asp Arg Pro Arg Tyr Pro Gly Ser Gly Met Arg Gly lie Asp Leu 
195 200 205 

Gly Gly Leu Val Phe Asp Asp Cys Pro Val Pro Ser Ser Ala Leu Leu 
210 215 220 

Gly Glu Gin Gly His Gly He Glu Val Ala Leu Arg Ala Tyr Gin Val 
225 230 235 240 

Thr Arg Met Val Ser Pro Ala Leu Leu Val Gly Pro Leu Asp Ser Ala 
245 250 255 
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Val Arg Leu Ala Thr Glu Met Ala Met Glu Arg Arg Leu Tyr Gly Ala 
260 265 270 

Ala Val Ala Asp Leu Pro Tyr Val Arg Thr Thr lie Ala Arg Ala Tyr 
275 280 285 

Ala Ala Leu Leu Thr Val Asp Val Phe Ser Gly Val Gly Leu Arg Ala 
290 295* 300 

Leu His Leu Leu Pro Glu Ala Thr Ala Gly Tyr Ala Pro Ala Val Lys 
305 310 315 320 

Tyr Leu Thr Ala Gin lie Val Leu Asp Ala lie Asp Asp Leu Arg Ser 
325 330 335 

Val Leu Gly Ala Gin Gly Tyr Leu Arg Gin Gly Pro Tyr Ala Met Phe 
340 345 350 

Gin Lys Leu Val Arg Asp Ala Ala Pro Ala Ser Phe Ala His Val Ser 
355 360 365 

Arg Ala Ala Cys Leu Val Met Leu Leu Pro His Leu Pro Arg Leu Ala 
370 375 380 

Arg Arg Ser Trp Thr Ala Glu Glu Pro Pro Pro Asp Asn Val Phe Thr 

385 390 395 400 

r 

Leu Gly Gly Glu Leu Ser Pro Leu Asp Phe Ser Arg Leu Val Ser Gly 
405 410 415 

Met Arg Gly Asp Pro Leu Ala Gly Val Leu His Asp Ser Trp His Asp 
420 425 430 

Glu Gly Pro Val Gly Arg Phe Ala Glu Arg Phe His Arg Glu Leu Thr 
435 440 445 

Gly Leu Arg Asp Ala Cys Arg Glu Leu Gly Pro Ala Asp lie Thr lie 
450 455 460 

Asp Ala Asn Pro Ala Ala Phe Ala Leu Ala Asp Arg Tyr Thr Val Leu 
465 470 475 480 

Leu Ala Ala Ala Cys Ala Leu Gly Val Trp Arg Ala Gly Gly Arg Leu 
485 490 495 

His Arg Pro Ala Leu Leu Ala Val Leu Asp Gly Leu Ala Gly Arg Leu 
500 % 505 510 

Gly Gly Glu Ala Val Leu Ser Val Ala Glu Arg Glu His Val Glu His 
515 520 525 

Gin Leu Phe Glu Met Ala Ala Asp Arg Val Arg Thr Ser Arg Leu Leu 
530 535 540 

Asp Leu Ser Ala Arg Gin Leu Pro Gly 
545 550 

<210> 26 
<211> 585 



137 



<212> PRT 

<213> Actinoplanes sp . 



<400> 26 

Met Thr Val Arg Pro Leu Ala Pro Pro Ala Glu Val Arg Leu Asp Asp 
15 10 15 

Leu Leu Gly Pro Glu Asp Ala Trp Asp Ala Glu Thr Ala Ala Arg Asp 
20 25 30 

lie Ala Glu Glu Phe Pro Ala Arg Leu His Asp Arg Leu Asn Ser Phe 
35 40 45 

Gly Leu Gin Ser Trp Tyr Val Pro Pro Glu Trp Gly Gly Ala Pro Gly 
50 55 60 

Asp His Glu Arg Leu Leu His Leu Trp Arg Ala Val Ala Arg Arg Asp 
65 70 75 80 

Leu Ser Ala Ala Val Ala His Gly Lys Thr Tyr Leu Gly Ser Ala Pro 
85 90 95 

Val Trp Leu Ala Gly Asp Asp Gly Gin Arg Ala Thr Leu Ala Ala Ala 
100 105 110 

Val Leu Ala Gly Thr Pro Val Ala Trp Ala Leu Ser Glu Pro Asp His 
115 120 125 

Gly Ala Asp Leu Leu His Gly Thr Thr Thr Ala Leu Pro His Asp Ala 
130 135 140 

Gly Tyr Arg Leu Arg Gly Leu Lys Trp Pro lie Asn Asn Ala Thr Arg 
145 150 155 160 

Ala Arg Tyr Leu Thr Val Leu Ala Arg Thr Gly Arg Ala Gly Asp Ala 
165 170 175 

Arg Gly Gin Ser Leu Phe Leu Val Asp Lys Glu Ala Leu Ala Pro Gly 
180 185 190 

Thr Trp Leu Pro Arg Pro Lys Val Ala Thr His Gly Val Arg Gly lie 
195 200 205 

Asp lie Ser Gly lie Ala Phe Glu Asp Ala Gly Leu Pro Gly Thr Ala 
210 215 220 

Leu Leu Gly Arg Ala Gly Ser Gly Leu Glu Thr Val Leu Arg Ser Leu 
225 230 235 240 

Gin Leu Thr Arg Thr Met Cys Ala Gly Leu Ser Leu Gly Ala Gly Asp 
245 250 255 

Arg Ala Leu Arg Leu Thr Ala Arg Phe Val Ala Gin Arg Met lie Met 
260 265 270 

Arg Arg Pro Leu Leu Asp Arg Gly His Pro Ala Gly lie Leu Ala Arg 
275 280 285 



Cys Ala Ala Leu Leu Ala Ala Ala Glu Ala Thr Ala Val Val Gly Thr 
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290 295 300 

Arg Ser Val His Ser Leu Thr Ala Glu Met Ser Val Thr Ser Ala He 
305 310 315 320 

Val Lys Ala Tyr Val Pro Thr Val Val Asp Arg Val Leu Arg Glu Leu 
325 330 335 

Ala Glu Leu Leu Gly Ser Arg Ser Phe Leu Arg Asp Glu Tyr Glu His 
340 345 350 

Gly Met Phe Pro Lys Leu Val Arg Asp His His Val Val Ala Val Phe 
355 360 365 

Asp Gly Ser Thr Pro Val Val Arg Thr Ala Leu Ala His Gin Phe Pro 
370 375 380 

Arg Leu Ala Ala Gly Phe Ala Ala Gly Ala Val Ser Ala Glu Gly Leu 
385 390 395 400 

Ala Glu Ala Ser Ala Ala Gly Gin Pro Pro Pro Pro Leu Asp Arg Gly 
405 410 415 

Ala Leu Thr Leu Leu Ser Arg His Gly Cys Ser Val Val Gin Ala Leu 
420 425 430 

Pro Ala Leu Ala Val Ser Ala Ala Val Arg Gly Gly Pro Ala Gly Leu 
435 440 445 

Ala Arg His Ala Ala Ala Leu Ala Gly Glu Ala Arg Arg He Cys Gly 
450 455 460 

Gin Met Thr Glu Leu Gly Pro Ser Ala Arg Pro Ser Met Val Gly His 
465 470 475 480 

Glu Leu Ala Ala Ala Tyr Glu Trp Cys Tyr Ala Gly Ala Ala Cys Leu 
485 490 495 

Leu Leu Trp Thr Ser Ala Glu Gly Arg His Thr Ala Asp Pro Leu Trp 
500 505 510 

Ala Asp Gly Leu Trp Val Leu Ala Ala Leu Arg Ala Val Arg Arg Glu 
515 520 525 

Leu Ala Arg Val Leu Arg Ala Pro Ala Pro Asp Pro Gly Pro His Asp 
530 535 540 

Asp Gly Ala Asp Arg Leu Leu Ala Ala Arg Val Ala Ala Ala Ala Arg 
545 550 555 560 

Thr Gly Glu Pro Val Thr Pro Phe Gly Thr Ala Leu Arg Pro Pro Ala 
565 570 575 

Gly Thr Val Arg Ala Glu Asp Gly Arg 
580 585 

<210> 27 
<211> 587 
<212> PRT 

<213> Actinoplanes sp. 
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<400> 27 



Met Val lie Asp Ala Ala Thr Gin Pro Thr Val Pro Asp Ala Phe Arg 
15 10 15 

Ala Gin Ala lie Ala Arg Pro Gly Glu Pro Ala Leu Val Val Leu Pro 
20 25 30 

Gly Asp Pro Asp Ala Glu Pro Val Thr Leu Thr Tyr Ala Glu Leu Asp 
35 40 45 

Arg Arg Ala Ala Ala Arg Ala Ala Trp Leu Ala Ala Arg Phe Pro Ala 
50 55 60 

Gly Glu Arg lie Leu lie Ala Leu Pro Thr>Gly Ala Glu Phe Val Glu 
65 70 75 80 

Leu Tyr Leu Ala Cys Leu Tyr Ala Gly Leu Val Ala Val Pro Ala Pro 
85 90 95 

Pro Pro Gly Gly Ser Ser Gly Ala Ser Glu Arg Thr Val Gly lie Ala 
100 105 110 

Ala Asp Cys Ser Pro Ala Leu Ala Val Val Asn Ala Asp Asp Ala Ala 
115 120 125 

Pro Leu Thr Ala Val Leu Arg Glu Arg Gly Leu Ser Gly Leu Pro Val 
130 135 140 

Gly Ala Leu Pro Pro Leu Ala Ala Glu Ala lie Arg Pro Pro Arg Gly 
145 150 155 160 

Pro Arg Pro Asp Ser Leu Ala Val Leu Gin Tyr Ser Ser Gly Ser Thr 
165 170 175 

Gly Ser Pro Lys Gly Val Met Leu Ser His Arg Ala Val Leu Ala Asn 
180 185 190 

Leu Arg Ala Phe Asp Arg Ser Ser Gly His Asn Ser Asp Asp Val Phe 
195 200 205 

Gly Ser Trp Leu Pro Leu His His Asp Met Gly Leu Phe Ala Met Leu 
210 215 220 

Thr Ala Gly Leu Leu Asn Gly Ala Gly Val Val Leu Met Ser Pro Thr 
225 230 235 240 

Ala Phe Val Arg Arg Pro Ala Asp Trp Leu Arg Met Met Asp Arg Tyr 
245 250 255 

Arg Val Thr lie Ser Ala Ala Pro Asn Phe Ala Tyr Asp Leu Cys Val 
260 265 270 

Arg Ala Val Arg Asp Glu Gin lie Ala Gly Leu Asp Leu Ser Arg lie 
275 280 285 

Arg Thr Leu Tyr Asn Gly Ser Glu Pro Val Asn Pro Ala Thr Val Arg 
290 295 300 
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Ala Phe Thr Glu Arg Phe Ala Pro Phe Gly Leu His Thr His Ala Val 
305 310 315 320 



Asn Pro Cys Tyr Gly Met Ala Glu Phe Thr Ala Tyr Val Ser Thr Lys 
325 330 335 

Val Phe Glu Ala Pro Ala Val Phe Leu Pro Ala Asp Pro Arg Ala Leu 
340 345 350 

Glu Asp Ala Ala Ser Pro Ala Leu Arg Pro Ala Asp Pro Ala Ala Ala 
355 360 365- 

Arg Glu lie Pro Gly Val' Gly Arg Val Pro Asp Phe Glu Val Leu lie 
370 375 380 

Val Asp Pro Asp Gly Leu Arg Pro Leu Pro Glu Gly Arg Val Gly Glu 
385 390 395 400 

lie Trp Leu Arg Gly Pro Gly Ala Gly Ala Gly Tyr Trp Gly Arg Thr 
405 410 415 

Glu Leu Asn Pro Gly lie Phe Asp Ala Arg Pro Ala Gly Asp Gly Gin 
420 425 430 

Asp Gly Gly Trp Val Arg Thr Gly Asp Leu Gly Ala Leu Thr Gly Gly 
435 440, 445 

Glu Leu Phe Leu Thr Gly Arg Leu Lys Glu Leu Leu lie Val His Gly 
450 455 460 

Arg Asn Leu Ala Pro His Asp Leu Glu Arg Glu Ala Arg Ala Ala His 
465 470 475 480 

Asp Ala Val Asp His Gin He Gly Ala Ala Phe Gly Val Pro Ala Pro 
485 490 495 

Asp Glu Arg He Val Leu Val Gin Glu Val His Pro Arg Thr Pro Leu 
500 505 510 

Asp Glu Leu Pro Arg Val Ala Ser Ala Val Ser Arg Arg Leu Thr Val 
515 v 520 525 

Ser Phe Gly Val Pro Val Arg Asn Val Leu Leu Val Arg Arg Gly Thr 
530 535 540 

Val Arg Arg Thr Thr Ser Gly Lys He Arg Arg Thr Ala Val Arg Glu 
545 550 555 560 

Arg Phe Leu Ala Gly Gly He Thr Ala Leu His Ala Glu Leu Glu Pro 
565 570 575 

Ala Leu Arg Pro Val Gin Ala Gly Ala Gly Arg 
580 585 

<210> 28 
<211> 75 
<212> PRT 

<213> Actinoplanes sp. 
<220> 
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<221> misc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. Ti is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 28 

Val Pro Asn Pro Phe. Glu Asp Pro Asp Ala Asn Tyr Leu Val Leu Val 
15 10 15 

Asn Asp Glu Gly Gin His Ser Leu Trp Pro Val Phe Ala Asp Val Pro 
20 25 30 

Asp Gly Trp Thr Thr Val Phe Gly Glu Ala Gly Arg Gin Asp Cys Leu 
35 40 45 

Asp Tyr lie Glu Lys Ser Trp Thr Asp Met Arg Pro Lys Ser Leu lie 
50 55 60 

Ala Ala Met Glu Lys Gin Lys Gin Pro Gin Ser 
65 70 75 

<210> 29 
<211> 94 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

< 2 2 1 > mi s c_ f ea tur e 
<222> (1) . . (1) 

<223> V is a non-standard initiator codon. It is expected that the bio 
synthesized protein will have a f ormylmethionine residue at this 
position 



<400> 29 

Val Ala Pro Gly Ala Pro Pro Ala Glu His Gly Glu Ala Val Pro Glu 
1 5 1-0 15 

Ala Asp lie Pro Val Leu Arg Asn Arg lie Asp Glu lie Asp Ala Ala 
20 25 30 

lie Met Arg Leu Trp Gin Glu Arg Ala Ser lie Ser Gin Lys lie Gly 
35 40 45 

Ser lie Arg Leu Ala Ser Gly Gly Thr Arg Val Val Leu Ser Arg Glu 
50 55 60 

Gin Glu Val lie Gin Arg Phe Arg Ala Ala Leu Gly Glu Asp Gly Thr 
65 70 75 80 

Thr lie Ala Leu Met Leu Leu Arg Ala Gly Arg Gly Pro Leu 
85 90 

<210> 30 
<211> 619 
<212> PRT 
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<213> Actinoplanes sp . 
<220> 

<221> misc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 30 

Val Asp Val Pro Arg Val Arg Pro Pro Gly Ala Ala Pro Ala Pro Arg 
15 10 15 

Arg Arg Arg Trp Arg Phe Trp Gin Ser Pro Asp Gly Gin Pro Ala Trp 
20 25 30 

Ala Arg Pro Ala Leu Leu Gly lie Ala Ala Leu Ala Ala Val Leu Tyr 
35 40 45 

Thr Ala Asn Leu Ala Arg Ser Gly Tyr Pro Met Tyr Tyr Ala Val Ala 
50 55 60 

Val Lys Ser Met Ser Val Ser Trp Pro Ala Phe Trp Thr Gly Ala Phe 
65 70 75 80 

Asp Pro Ala Ala Ser lie Thr lie Asp Lys Leu Ala Gly Ala Phe Val 
85 90 95 

Pro Gin Ala Leu Ser Ala Arg Val Phe Gly Phe His Gin Trp Ser Leu 
100 105 110 

Ala Leu Pro Gin Ala Val Glu Gly Val He Ala Val Leu Val Leu Tyr 
115 120 125 

Arg Ala Val Arg Arg Trp His Gly Pro Gly Ala Gly Leu Ala Ala Ala 
130 135 140 

Gly Leu Phe Ala Thr Thr Pro He Val Ser Ser Met Phe Gly His Ser 
145 150 155 160 

Met Glu Asp Gly Ala Leu Thr Leu Cys Leu Val Leu Ala Ala Asp Ala 
165 170 175 

Phe Gly Ala Ala Val Thr Arg Gly Ser Pro Ala Arg Leu Ala Leu Ala 
180 185 190 

Gly Ala Trp He Gly Leu Gly Phe Gin Ala Lys Met Met Gin Ala Trp 
195 200 205 

Leu Val Leu Pro Ala Leu Val Val Thr Tyr Leu Ala Gly Ala Pro Val 
210 215 220 

Arg Ala Arg Ala Arg Val Val His Val Ala Ala Ala Val Ala Ala Thr 
225 230 235 240 

Leu Ala Val Ser Leu Leu Trp Val Leu Ala Leu Thr Leu Leu Pro Gly 
245 250 255 
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Ser His Arg Pro Trp Ala Asp Gly Thr Thr Ser Gly Asn Ala Phe Ala 
260 265 270 

Met Val Phe Gly Tyr Asn Gly Phe Asp Arg Ala Gly lie His Val Pro 
275 280 285 

Gly Ala Leu Thr Thr Gly Phe Thr Asp Gly Gly Ala Ala Ala Gly Gly 
290 295 300 

Ser Trp Thr Ala Leu Ala Ala Asp Arg Leu Ala Thr Gin lie Gly Trp 
305 310 315 320 

Trp Tyr Pro Leu Ala Leu Thr Gly Leu Leu Leu Gly Leu Ala Arg Trp 
325 330 335 

Arg Thr Ala Arg Ala Gly Leu Leu Phe Trp Gly Leu Trp Leu Leu Thr 
340 345 350 

Ala Ala Val Val Leu Ser Arg lie Thr lie Gin His Asn Ala Tyr Leu 
355 360 365 

Ala Val Leu Ala Pro Pro Leu Ala Ala Leu Ala Ala Ala Gly Ala Val 
370 375 380 

Gin Leu Trp Arg Thr His Arg Asp Gly Thr Ala Pro Trp Leu Leu Pro 
385 390 395 400 

Ala Val Val Val Val Gin Ala Gly Trp Thr Leu Trp Leu Ala Thr Arg 
405 410 415 

Tyr Pro Ser Phe Leu Ala Gly Leu Thr Trp Thr Ala Pro lie Ala Ala 
420 425 430 

Val Leu Ala Val Val Val Leu Ala Ala Arg Pro Thr Ala Arg Arg Pro 
435 440 445 

Ala Val Val Val Val Val Ala Gly Leu Leu Ala Val Pro Val Ala Trp 
450 455 460 

Gly Ala Ser Val Leu Asn Pro Arg Tyr Ala Gly Thr Ser Phe Glu Ala 
465 470 475 480 

Gly Ala Gly Pro Ser Gly Pro Val Gly Val Arg Leu Asp Asp Asp Thr 
485 490 495 

Thr Asp Arg Leu Thr Pro Gly Leu Arg Arg Leu Asp Asp Tyr Leu Ala 
500 505 510 

Ala His Arg Asp Gly Arg Thr Tyr Leu Ala Ala Thr Ser Ser Trp Arg 
515 520 525 

Thr Ala Gly Arg Leu lie Val Pro Thr Gly His Ser Tyr Leu Pro Leu 
530 535 540 

Gly Gly Phe Ser Gly Ala Ala Pro Phe Pro Ser Leu Ala Gly Val Gin 
545 550 555 560 

Arg Leu Val Arg Asp Gly Glu Leu Arg Tyr Phe Val Leu Gly Gly Pro 
565 570 575 
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Glu Gly Leu Gly Gly Glu Ala 
580 



Thr Glu Ala Tyr Arg lie Thr Gly Trp 
585 590 



Val Leu Glu Thr Cys Ala Thr Val Pro Pro Ala Glu His Gly Ala Asp 
595 600 605 

Pro Asp Leu Thr Val Leu Arg Cys Asp Lys Pro 
610 615 

<210> 31 
<211> 355 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> (1) . . (1) 

<223> V represents a non-standard initiator codon. It is expected that 
the biosynthesized protein will have a f ormylmethionine residue a 
t this position 



<400> 31 

Val Asp Asn Gly Thr Phe Thr Asp Leu Arg lie Asp His lie Glu Phe 
15 10 15 

Ala Val Ala Asp Val Glu Ser Ala Ser Ala Pro Phe Thr Glu Gly Tyr 
20 25 30 

Gly Phe Ser Val Tyr Gly Gly Thr Gly Asp Ala His Ala Pro Val Arg 
35 40 45 

Arg Val Ala Leu Gly Arg Asp Asp lie Arg Leu Val Leu Thr Ala Ala 
50 55 60 

Pro Gly Gly Asp His Pro Ala Met Ala Tyr Val Glu Gin His Gly Asp 
65 70 75 80 

Gly Val Ser Ala lie Ala Leu Ser Thr Arg Asp Ala His Ala Ala Phe 
85 90 95 

Thr Glu Ala Val Arg Arg Gly Ala Val Gly Val Ser Ala Pro Val Thr 
100 105 110 

Gly Asn Gly Val Thr Val Ala Thr lie Arg Gly Phe Gly Asp Val Leu 
115 120 125 

His Thr Phe Val Glu Arg Ala Pro Gly Ala Asp Pro Arg Thr Leu Pro 
130 135 140 

Gly Leu Glu Leu Arg Arg Pro Ser Pro Thr Arg Phe Asp Ser Gly Leu 
145 150 155 160 

Gin Ala lie Asp His lie Ala Val Cys Leu Glu Pro Gly Thr Leu Asp 
165 170 175 



Pro Thr Val Asp Phe Tyr Arg Asp Val Leu Asp Phe Glu Met lie Phe 
180 185 190 
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Glu Glu Arg He Leu Val Gly Arg Gin Ala Met Asp Ser Lys Val Val 
195 200 205 

Gin Ser Arg Ser Gly Gly Val Thr Leu Thr Leu He Glu Pro Asp Thr 
210 215 220 

Ser Leu Glu Gin Gly Gin He Asp Thr Phe Leu Lys Asn His Gly Gly 
225 230 235 240 

Pro Gly Val Gin His Leu Ala Phe He Thr Asp Asp Val Leu Arg Ser 
245 250 255 

Val Gly Arg Met Ser Glu His Gly Val Glu Phe Leu His Thr Pro Asp 
260 265 270 

Ser Tyr Tyr Gly Arg Leu Pro Gly Arg He Pro Gin Ala Gly His Pro 
275 280 285 

He Gin Ala Leu Arg Asp Leu Asn Val Leu Val Asp Gin Asp His Asp 
290 295 300 

Gly Gin Leu Phe Gin He Phe Thr Lys Ser Val His Pro Arg Gly Thr 
305 310 315 320 

He Phe Met Glu Val He Glu Arg Met Gly Ala Arg Ser Phe Gly Ser 
325 330 335 

Gly Asn He Lys Ala Leu Tyr Glu Ala Val Glu Leu Asp Met Ser Lys 
340 345 350 

Gin Ser Ala 
355 

<210> 32 

<211> 429 

<212> PRT 

<213> Actinoplanes sp. 

<400> 32 

Met Glu Ser Pro Ala Thr His Ala Glu Leu Val He Gly Thr Val Leu 
1 5 10 15 

Leu Asp He Ala Leu Val Leu Ala Ala Gly Ala Leu Leu Gly Arg Trp 
20 25 30 

Val Arg Arg Leu Arg Gin Pro Ala Val He Gly Glu He Leu Ala Gly 
35 40 45 

He Ala Leu Gly Pro Ser Leu Leu Gly Leu Leu Pro Gly Asn Pro Thr 
50 55 60 

Ala Trp Leu Phe Pro Ala Glu Ala Arg Pro Tyr Leu Ser Ala Val Ala 
65 70 75 80 

Gin He Gly Leu Ala Leu Phe Thr Phe Leu He Gly Trp Glu Phe Asn 
85 90 95 

Pro Ala Thr Leu Ala Arg His Arg Gly Thr Ala Ala Ala Val Ser lie 
100 105 110 
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Gly Ser He Ala Val Ser Phe Gly Leu Gly He Ala Leu Ala Thr Val 
115 120 125 

Leu His Pro Arg His Asp Thr Thr Gly Gly Gly Lys Val Gly Phe Thr 
130 135 140 

Glu Phe Ala Leu Phe Leu Gly Val Ala Met Ser He Thr Ala Phe Pro 
145 150 155 160 

Val Leu Ala Arg He Leu Ala Glu Arg Arg Leu Thr Gly Thr Arg Val 
165 170 175 

Gly Ser He Ala Leu Val Ser Ala Ala He Asp Asp Val Val Ala Trp 
180 185 190 

Cys Leu Leu Ala Leu Val Thr Ala He Ala Thr Ala Ser Gly Pro Val 
195 200 205 

Gin Leu Val Arg He Leu Ala Leu Leu Ala Val Phe Leu Val Val Leu 
210 215 220 

Val Thr Val Val Arg Pro Leu Leu Val Leu Leu Ala Arg Arg Pro Ser 
225 230 235 240 

Ala Ser Tyr Leu Leu Val Ala Val Val Ala Val Val Leu Leu Ser Ala 
245 250 255 

Tyr Ala Thr Thr Trp He Gly Leu His Ala He Phe Gly Ala Phe Cys 
260 265 270 

Ala Gly Leu Val Met Pro Arg Glu Pro Ala Ala Ala Leu Arg Glu Arg 
275 280 285 

Val Arg Gin Pro Leu Glu His Val Ser Val Val Leu Leu Pro Val Phe 
290 295 300 

Phe He Val Thr Gly Leu Gly Val Asp He Gly Ala Leu Thr Ala Ala 
305 310 315 320 

Asn He Leu Glu Leu Ala Ala He He Val He Ala Cys Ala Gly Lys 
325 330 335 

Leu Ala Gly Ala He Val Pro Ala Val Ser Leu Gly Met Ser Trp Arg 
340 345 350 

Asp Ala Arg Thr Leu Gly Leu Leu Val Asn Thr Arg Gly Leu Thr Glu 
355 360 365 

Leu Val Val Leu Asn Val Gly Leu Gin Leu Ala Val Leu Asp Gly Gin 
370 375 380 

Met Phe Thr Met Met Val Leu Met Ala Leu Val Thr Thr Ala Leu Ala 
385 390 395 400 

Gly Pro Leu He Gly Ser Ala Arg Thr Pro Ala Ala Gly Ala Pro Ala 
405 410 415 



Gin Ala Leu Pro Ala Glu Pro Arg Thr Arg Arg Ala Ala 
420 425 
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<210> 33 
<211> 189 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> misc_f eature 
<222> (1)..(1) 

<223> V represents a non-standard 
the biosynthesized protein 
at this position 



initiator codon. It is expected that 
will have a f ormylmethionine residue 



<400> 33 

Val Ser Asp Glu Ala Ala Val Pro Ser Pro Ala Arg Leu Leu Arg Asp 
15 10 15 

Phe Val Asn Thr Tyr Glu Pro Gin Val Asp Asp Glu Ser Leu Ser Thr 
20 25 30 

Pro Asp Ala Leu Arg Ala Trp Leu Ala Gly Glu Ser Leu Leu Ala Pro 
35 40 45 

Gly Ala Arg Val Arg Pro Ala Asp Leu Ala Arg Ala Val Ala Leu Arg 
50 55 60 

Glu Gly Leu Arg Gin Val Leu Leu Gly His Ala Gly His Pro Ala Asp 
65 70 75 80 

Pro Ala Ala Leu Arg Arg Leu Glu Glu lie Leu Ala Ala Val Pro Val 
85 90 95 

Arg Leu Ser Leu Ala Gly Gly Ala Pro Arg Leu Leu Pro Ala Gly Gly 
100 105 110 

Thr Pro Phe Asp Arg Ala Leu Ala Gly Leu lie Asp Ala Val Arg Gin 
115 120 125 

Cys Ala Glu Leu Gin Val Trp Thr Arg Leu Lys Val Cys Asp Arg Asp 
130 135 140 

Thr Cys Arg Trp Ala Tyr Tyr Asp Ala Ser Arg Asn Gin Ala Arg Arg 
145 150 155 160 

Trp Cys Ser Met Ala Gly Cys Gly Asn Tyr lie Lys Met Arg Arg Ala 
165 170 175 

Tyr Ala Ala Arg Arg Val Arg Gly Ser Ala Gly Ser Ala 
180 185 

<210> 34 
<211> ' 309 
<212> PRT 

<213> Actinoplanes sp. 
<220> 

<221> mi sc_f eature 
<222> (1)..(1) 



148 



<223> 



V represents a non-standard initiator codon. It is expected that 

the biosynthesized protein will have a f ormylmethionine residue 
at this position 



<400> 34 

Val Ala Thr Thr Leu Arg Asp Val Ala Arg Leu Ala Arg Val Ser Val 
15 10 15 

Lys Thr Val Ser Asn Val Val Asn. Asp His Pro His Val Ser Asp Asp 
20 25 30 

Val Arg Arg Arg Val Glu Thr Ala lie Arg Gin Leu Gly Tyr Arg Pro 
35 40 45 

Asn Leu Val Ala Arg Ala Leu Arg Ser Gly Arg Gly Ser Gly Leu Leu 
50 55 60 

Ala Leu Ala Met Pro Gly Ala Gly Ala Pro Gin Ser Pro* Ala Leu lie 
65 70 75 80 

Glu Glu lie lie Arg Arg Ala Ala Pro Leu Gly Phe Arg Val Leu lie 
85 90 95 

Glu Pro Leu Glu Ser Ser Arg Pro Arg Pro Pro Ala Pro Gly Val Asp 
100 105 110 

Ala Arg Leu Leu Asn Ala Glu Ala Pro Ala Pro Glu Leu Val Asp Ala 
115 120 125 

Gin Ala Ala Thr Gly Thr Pro Leu Val Leu Leu Thr Gly Thr Pro Asp 
130 135 140 

Pro Arg Tyr Asp Cys Val Gly Pro Asp Ala Ala Arg Ala Ala Glu Asp 
145 150 155 160 

Ala Val Asp His Leu Arg Arg Leu Gly Arg Arg Arg Val Ala Thr lie 
165 170 175 

Gly Gly Ser Leu Ser Thr Gly Pro Ala Gly Ser Gly Ser Asp Phe Gly 
180 185 190 

Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly 
195 200 205 

Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Phe Gly Ser Gly 
210 215 220 

Ser Gly Phe Gly Ser Gly Ser Ala Glu Gly Tyr Arg Ala Ala Arg Gin 
225 230 235 240 

Leu Leu Gly His Glu Asp Arg Pro Asp Ala lie Val Cys Gly Ser Val 
245 250 255 

Arg Leu Ala Val Gly Val lie Arg Ala Ala Ala Asp Ala Gly Leu Arg 
260 265 270 

Val Pro Glu Asp Val Ala Val He Gly He Gly Asp Gly Glu Glu Gly 
275 280 285 
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Arg Tyr Thr Arg Pro Ala Leu Thr Thr Val Ala Thr Asp Pro Ala Phe 
290 295 300 



lie Ala Gly Lys Ala 
305 
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